hello
hello

📌S Retain class distribution for seed 5:
Class 0: 4500
Class 1: 4500
Class 2: 4500
Class 3: 4500
Class 4: 4500
Class 5: 4500
Class 6: 4500
Class 7: 4500
Class 8: 4500
Class 9: 4500

📌S Forget class distribution for seed 5:
Class 0: 500
Class 1: 500
Class 2: 500
Class 3: 500
Class 4: 500
Class 5: 500
Class 6: 500
Class 7: 500
Class 8: 500
Class 9: 500

📊 Updated class distribution:
Retain set:
  Class 0: 4625
  Class 1: 4625
  Class 2: 4625
  Class 3: 4625
  Class 4: 4625
  Class 5: 4625
  Class 6: 4625
  Class 7: 4625
  Class 8: 4625
  Class 9: 4625
Forget set:
  Class 0: 375
  Class 1: 375
  Class 2: 375
  Class 3: 375
  Class 4: 375
  Class 5: 375
  Class 6: 375
  Class 7: 375
  Class 8: 375
  Class 9: 375
hello
hello
⚠️ Warning: Retain train loader may not be shuffled.
Training Epoch: 1 [256/46250]	Loss: 2.4314	LR: 0.000000
Training Epoch: 1 [512/46250]	Loss: 2.4121	LR: 0.000552
Training Epoch: 1 [768/46250]	Loss: 2.4351	LR: 0.001105
Training Epoch: 1 [1024/46250]	Loss: 2.3296	LR: 0.001657
Training Epoch: 1 [1280/46250]	Loss: 2.3175	LR: 0.002210
Training Epoch: 1 [1536/46250]	Loss: 2.1523	LR: 0.002762
Training Epoch: 1 [1792/46250]	Loss: 2.0930	LR: 0.003315
Training Epoch: 1 [2048/46250]	Loss: 1.8616	LR: 0.003867
Training Epoch: 1 [2304/46250]	Loss: 1.6945	LR: 0.004420
Training Epoch: 1 [2560/46250]	Loss: 1.4751	LR: 0.004972
Training Epoch: 1 [2816/46250]	Loss: 1.2824	LR: 0.005525
Training Epoch: 1 [3072/46250]	Loss: 1.0190	LR: 0.006077
Training Epoch: 1 [3328/46250]	Loss: 0.8741	LR: 0.006630
Training Epoch: 1 [3584/46250]	Loss: 0.7148	LR: 0.007182
Training Epoch: 1 [3840/46250]	Loss: 0.4703	LR: 0.007735
Training Epoch: 1 [4096/46250]	Loss: 0.4156	LR: 0.008287
Training Epoch: 1 [4352/46250]	Loss: 0.3754	LR: 0.008840
Training Epoch: 1 [4608/46250]	Loss: 0.2873	LR: 0.009392
Training Epoch: 1 [4864/46250]	Loss: 0.2751	LR: 0.009945
Training Epoch: 1 [5120/46250]	Loss: 0.1992	LR: 0.010497
Training Epoch: 1 [5376/46250]	Loss: 0.2482	LR: 0.011050
Training Epoch: 1 [5632/46250]	Loss: 0.2409	LR: 0.011602
Training Epoch: 1 [5888/46250]	Loss: 0.1308	LR: 0.012155
Training Epoch: 1 [6144/46250]	Loss: 0.1952	LR: 0.012707
Training Epoch: 1 [6400/46250]	Loss: 0.2155	LR: 0.013260
Training Epoch: 1 [6656/46250]	Loss: 0.2568	LR: 0.013812
Training Epoch: 1 [6912/46250]	Loss: 0.2526	LR: 0.014365
Training Epoch: 1 [7168/46250]	Loss: 0.2083	LR: 0.014917
Training Epoch: 1 [7424/46250]	Loss: 0.2358	LR: 0.015470
Training Epoch: 1 [7680/46250]	Loss: 0.2070	LR: 0.016022
Training Epoch: 1 [7936/46250]	Loss: 0.2400	LR: 0.016575
Training Epoch: 1 [8192/46250]	Loss: 0.1926	LR: 0.017127
Training Epoch: 1 [8448/46250]	Loss: 0.2114	LR: 0.017680
Training Epoch: 1 [8704/46250]	Loss: 0.1554	LR: 0.018232
Training Epoch: 1 [8960/46250]	Loss: 0.1795	LR: 0.018785
Training Epoch: 1 [9216/46250]	Loss: 0.2513	LR: 0.019337
Training Epoch: 1 [9472/46250]	Loss: 0.1663	LR: 0.019890
Training Epoch: 1 [9728/46250]	Loss: 0.2553	LR: 0.020442
Training Epoch: 1 [9984/46250]	Loss: 0.2873	LR: 0.020994
Training Epoch: 1 [10240/46250]	Loss: 0.2044	LR: 0.021547
Training Epoch: 1 [10496/46250]	Loss: 0.3559	LR: 0.022099
Training Epoch: 1 [10752/46250]	Loss: 0.2243	LR: 0.022652
Training Epoch: 1 [11008/46250]	Loss: 0.2684	LR: 0.023204
Training Epoch: 1 [11264/46250]	Loss: 0.2114	LR: 0.023757
Training Epoch: 1 [11520/46250]	Loss: 0.3497	LR: 0.024309
Training Epoch: 1 [11776/46250]	Loss: 0.4591	LR: 0.024862
Training Epoch: 1 [12032/46250]	Loss: 0.2597	LR: 0.025414
Training Epoch: 1 [12288/46250]	Loss: 0.3153	LR: 0.025967
Training Epoch: 1 [12544/46250]	Loss: 0.4524	LR: 0.026519
Training Epoch: 1 [12800/46250]	Loss: 0.4767	LR: 0.027072
Training Epoch: 1 [13056/46250]	Loss: 0.4884	LR: 0.027624
Training Epoch: 1 [13312/46250]	Loss: 0.3112	LR: 0.028177
Training Epoch: 1 [13568/46250]	Loss: 0.3148	LR: 0.028729
Training Epoch: 1 [13824/46250]	Loss: 0.3963	LR: 0.029282
Training Epoch: 1 [14080/46250]	Loss: 0.2692	LR: 0.029834
Training Epoch: 1 [14336/46250]	Loss: 0.3586	LR: 0.030387
Training Epoch: 1 [14592/46250]	Loss: 0.3733	LR: 0.030939
Training Epoch: 1 [14848/46250]	Loss: 0.2826	LR: 0.031492
Training Epoch: 1 [15104/46250]	Loss: 0.3111	LR: 0.032044
Training Epoch: 1 [15360/46250]	Loss: 0.3912	LR: 0.032597
Training Epoch: 1 [15616/46250]	Loss: 0.2559	LR: 0.033149
Training Epoch: 1 [15872/46250]	Loss: 0.2992	LR: 0.033702
Training Epoch: 1 [16128/46250]	Loss: 0.2388	LR: 0.034254
Training Epoch: 1 [16384/46250]	Loss: 0.1798	LR: 0.034807
Training Epoch: 1 [16640/46250]	Loss: 0.1808	LR: 0.035359
Training Epoch: 1 [16896/46250]	Loss: 0.2118	LR: 0.035912
Training Epoch: 1 [17152/46250]	Loss: 0.2491	LR: 0.036464
Training Epoch: 1 [17408/46250]	Loss: 0.1735	LR: 0.037017
Training Epoch: 1 [17664/46250]	Loss: 0.1685	LR: 0.037569
Training Epoch: 1 [17920/46250]	Loss: 0.2493	LR: 0.038122
Training Epoch: 1 [18176/46250]	Loss: 0.1386	LR: 0.038674
Training Epoch: 1 [18432/46250]	Loss: 0.1757	LR: 0.039227
Training Epoch: 1 [18688/46250]	Loss: 0.2315	LR: 0.039779
Training Epoch: 1 [18944/46250]	Loss: 0.1231	LR: 0.040331
Training Epoch: 1 [19200/46250]	Loss: 0.2225	LR: 0.040884
Training Epoch: 1 [19456/46250]	Loss: 0.1611	LR: 0.041436
Training Epoch: 1 [19712/46250]	Loss: 0.2523	LR: 0.041989
Training Epoch: 1 [19968/46250]	Loss: 0.1413	LR: 0.042541
Training Epoch: 1 [20224/46250]	Loss: 0.2411	LR: 0.043094
Training Epoch: 1 [20480/46250]	Loss: 0.1652	LR: 0.043646
Training Epoch: 1 [20736/46250]	Loss: 0.1422	LR: 0.044199
Training Epoch: 1 [20992/46250]	Loss: 0.1823	LR: 0.044751
Training Epoch: 1 [21248/46250]	Loss: 0.1398	LR: 0.045304
Training Epoch: 1 [21504/46250]	Loss: 0.1928	LR: 0.045856
Training Epoch: 1 [21760/46250]	Loss: 0.1540	LR: 0.046409
Training Epoch: 1 [22016/46250]	Loss: 0.1368	LR: 0.046961
Training Epoch: 1 [22272/46250]	Loss: 0.1629	LR: 0.047514
Training Epoch: 1 [22528/46250]	Loss: 0.2091	LR: 0.048066
Training Epoch: 1 [22784/46250]	Loss: 0.2141	LR: 0.048619
Training Epoch: 1 [23040/46250]	Loss: 0.1358	LR: 0.049171
Training Epoch: 1 [23296/46250]	Loss: 0.1450	LR: 0.049724
Training Epoch: 1 [23552/46250]	Loss: 0.2240	LR: 0.050276
Training Epoch: 1 [23808/46250]	Loss: 0.1280	LR: 0.050829
Training Epoch: 1 [24064/46250]	Loss: 0.1286	LR: 0.051381
Training Epoch: 1 [24320/46250]	Loss: 0.1933	LR: 0.051934
Training Epoch: 1 [24576/46250]	Loss: 0.1921	LR: 0.052486
Training Epoch: 1 [24832/46250]	Loss: 0.1773	LR: 0.053039
Training Epoch: 1 [25088/46250]	Loss: 0.2499	LR: 0.053591
Training Epoch: 1 [25344/46250]	Loss: 0.1586	LR: 0.054144
Training Epoch: 1 [25600/46250]	Loss: 0.2164	LR: 0.054696
Training Epoch: 1 [25856/46250]	Loss: 0.2405	LR: 0.055249
Training Epoch: 1 [26112/46250]	Loss: 0.2176	LR: 0.055801
Training Epoch: 1 [26368/46250]	Loss: 0.1914	LR: 0.056354
Training Epoch: 1 [26624/46250]	Loss: 0.2211	LR: 0.056906
Training Epoch: 1 [26880/46250]	Loss: 0.2057	LR: 0.057459
Training Epoch: 1 [27136/46250]	Loss: 0.1732	LR: 0.058011
Training Epoch: 1 [27392/46250]	Loss: 0.2527	LR: 0.058564
Training Epoch: 1 [27648/46250]	Loss: 0.2135	LR: 0.059116
Training Epoch: 1 [27904/46250]	Loss: 0.1980	LR: 0.059669
Training Epoch: 1 [28160/46250]	Loss: 0.1839	LR: 0.060221
Training Epoch: 1 [28416/46250]	Loss: 0.1995	LR: 0.060773
Training Epoch: 1 [28672/46250]	Loss: 0.2428	LR: 0.061326
Training Epoch: 1 [28928/46250]	Loss: 0.2191	LR: 0.061878
Training Epoch: 1 [29184/46250]	Loss: 0.2585	LR: 0.062431
Training Epoch: 1 [29440/46250]	Loss: 0.1135	LR: 0.062983
Training Epoch: 1 [29696/46250]	Loss: 0.2157	LR: 0.063536
Training Epoch: 1 [29952/46250]	Loss: 0.1921	LR: 0.064088
Training Epoch: 1 [30208/46250]	Loss: 0.1863	LR: 0.064641
Training Epoch: 1 [30464/46250]	Loss: 0.1763	LR: 0.065193
Training Epoch: 1 [30720/46250]	Loss: 0.2003	LR: 0.065746
Training Epoch: 1 [30976/46250]	Loss: 0.2744	LR: 0.066298
Training Epoch: 1 [31232/46250]	Loss: 0.2360	LR: 0.066851
Training Epoch: 1 [31488/46250]	Loss: 0.1804	LR: 0.067403
Training Epoch: 1 [31744/46250]	Loss: 0.1472	LR: 0.067956
Training Epoch: 1 [32000/46250]	Loss: 0.2456	LR: 0.068508
Training Epoch: 1 [32256/46250]	Loss: 0.1785	LR: 0.069061
Training Epoch: 1 [32512/46250]	Loss: 0.1760	LR: 0.069613
Training Epoch: 1 [32768/46250]	Loss: 0.2304	LR: 0.070166
Training Epoch: 1 [33024/46250]	Loss: 0.1753	LR: 0.070718
Training Epoch: 1 [33280/46250]	Loss: 0.1395	LR: 0.071271
Training Epoch: 1 [33536/46250]	Loss: 0.1230	LR: 0.071823
Training Epoch: 1 [33792/46250]	Loss: 0.1648	LR: 0.072376
Training Epoch: 1 [34048/46250]	Loss: 0.1601	LR: 0.072928
Training Epoch: 1 [34304/46250]	Loss: 0.1233	LR: 0.073481
Training Epoch: 1 [34560/46250]	Loss: 0.2264	LR: 0.074033
Training Epoch: 1 [34816/46250]	Loss: 0.1497	LR: 0.074586
Training Epoch: 1 [35072/46250]	Loss: 0.1624	LR: 0.075138
Training Epoch: 1 [35328/46250]	Loss: 0.1706	LR: 0.075691
Training Epoch: 1 [35584/46250]	Loss: 0.4529	LR: 0.076243
Training Epoch: 1 [35840/46250]	Loss: 0.2945	LR: 0.076796
Training Epoch: 1 [36096/46250]	Loss: 0.4860	LR: 0.077348
Training Epoch: 1 [36352/46250]	Loss: 0.4895	LR: 0.077901
Training Epoch: 1 [36608/46250]	Loss: 0.5064	LR: 0.078453
Training Epoch: 1 [36864/46250]	Loss: 0.5125	LR: 0.079006
Training Epoch: 1 [37120/46250]	Loss: 0.5587	LR: 0.079558
Training Epoch: 1 [37376/46250]	Loss: 0.4001	LR: 0.080110
Training Epoch: 1 [37632/46250]	Loss: 0.2647	LR: 0.080663
Training Epoch: 1 [37888/46250]	Loss: 0.3436	LR: 0.081215
Training Epoch: 1 [38144/46250]	Loss: 0.4323	LR: 0.081768
Training Epoch: 1 [38400/46250]	Loss: 0.3064	LR: 0.082320
Training Epoch: 1 [38656/46250]	Loss: 0.3433	LR: 0.082873
Training Epoch: 1 [38912/46250]	Loss: 0.2968	LR: 0.083425
Training Epoch: 1 [39168/46250]	Loss: 0.3919	LR: 0.083978
Training Epoch: 1 [39424/46250]	Loss: 0.3921	LR: 0.084530
Training Epoch: 1 [39680/46250]	Loss: 0.3712	LR: 0.085083
Training Epoch: 1 [39936/46250]	Loss: 0.3075	LR: 0.085635
Training Epoch: 1 [40192/46250]	Loss: 0.3092	LR: 0.086188
Training Epoch: 1 [40448/46250]	Loss: 0.3810	LR: 0.086740
Training Epoch: 1 [40704/46250]	Loss: 0.2327	LR: 0.087293
Training Epoch: 1 [40960/46250]	Loss: 0.4380	LR: 0.087845
Training Epoch: 1 [41216/46250]	Loss: 0.3533	LR: 0.088398
Training Epoch: 1 [41472/46250]	Loss: 0.2936	LR: 0.088950
Training Epoch: 1 [41728/46250]	Loss: 0.4730	LR: 0.089503
Training Epoch: 1 [41984/46250]	Loss: 0.4220	LR: 0.090055
Training Epoch: 1 [42240/46250]	Loss: 0.3992	LR: 0.090608
Training Epoch: 1 [42496/46250]	Loss: 0.4088	LR: 0.091160
Training Epoch: 1 [42752/46250]	Loss: 0.2648	LR: 0.091713
Training Epoch: 1 [43008/46250]	Loss: 0.3941	LR: 0.092265
Training Epoch: 1 [43264/46250]	Loss: 0.2725	LR: 0.092818
Training Epoch: 1 [43520/46250]	Loss: 0.3273	LR: 0.093370
Training Epoch: 1 [43776/46250]	Loss: 0.3731	LR: 0.093923
Training Epoch: 1 [44032/46250]	Loss: 0.2562	LR: 0.094475
Training Epoch: 1 [44288/46250]	Loss: 0.2896	LR: 0.095028
Training Epoch: 1 [44544/46250]	Loss: 0.2651	LR: 0.095580
Training Epoch: 1 [44800/46250]	Loss: 0.3126	LR: 0.096133
Training Epoch: 1 [45056/46250]	Loss: 0.2746	LR: 0.096685
Training Epoch: 1 [45312/46250]	Loss: 0.3102	LR: 0.097238
Training Epoch: 1 [45568/46250]	Loss: 0.2605	LR: 0.097790
Training Epoch: 1 [45824/46250]	Loss: 0.2464	LR: 0.098343
Training Epoch: 1 [46080/46250]	Loss: 0.2837	LR: 0.098895
Training Epoch: 1 [46250/46250]	Loss: 0.2241	LR: 0.099448
Epoch 1 - Average Train Loss: 0.3784, Train Accuracy: 0.8766
Epoch 1 training time consumed: 335.46s
Evaluating Network.....
Test set: Epoch: 1, Average loss: 0.0006, Accuracy: 0.9489, Time consumed:23.57s
Saving weights file to checkpoint/retrain/ViT/Friday_18_July_2025_00h_08m_56s/ViT-Cifar10-seed5-ret25-1-best.pth
Training Epoch: 2 [256/46250]	Loss: 0.2210	LR: 0.100000
Training Epoch: 2 [512/46250]	Loss: 0.1367	LR: 0.100000
Training Epoch: 2 [768/46250]	Loss: 0.3448	LR: 0.100000
Training Epoch: 2 [1024/46250]	Loss: 0.2282	LR: 0.100000
Training Epoch: 2 [1280/46250]	Loss: 0.2594	LR: 0.100000
Training Epoch: 2 [1536/46250]	Loss: 0.2942	LR: 0.100000
Training Epoch: 2 [1792/46250]	Loss: 0.2930	LR: 0.100000
Training Epoch: 2 [2048/46250]	Loss: 0.3122	LR: 0.100000
Training Epoch: 2 [2304/46250]	Loss: 0.2782	LR: 0.100000
Training Epoch: 2 [2560/46250]	Loss: 0.1701	LR: 0.100000
Training Epoch: 2 [2816/46250]	Loss: 0.3805	LR: 0.100000
Training Epoch: 2 [3072/46250]	Loss: 0.2231	LR: 0.100000
Training Epoch: 2 [3328/46250]	Loss: 0.2371	LR: 0.100000
Training Epoch: 2 [3584/46250]	Loss: 0.2844	LR: 0.100000
Training Epoch: 2 [3840/46250]	Loss: 0.2347	LR: 0.100000
Training Epoch: 2 [4096/46250]	Loss: 0.2825	LR: 0.100000
Training Epoch: 2 [4352/46250]	Loss: 0.2772	LR: 0.100000
Training Epoch: 2 [4608/46250]	Loss: 0.1912	LR: 0.100000
Training Epoch: 2 [4864/46250]	Loss: 0.2430	LR: 0.100000
Training Epoch: 2 [5120/46250]	Loss: 0.1658	LR: 0.100000
Training Epoch: 2 [5376/46250]	Loss: 0.2970	LR: 0.100000
Training Epoch: 2 [5632/46250]	Loss: 0.2977	LR: 0.100000
Training Epoch: 2 [5888/46250]	Loss: 0.2124	LR: 0.100000
Training Epoch: 2 [6144/46250]	Loss: 0.2393	LR: 0.100000
Training Epoch: 2 [6400/46250]	Loss: 0.2807	LR: 0.100000
Training Epoch: 2 [6656/46250]	Loss: 0.3072	LR: 0.100000
Training Epoch: 2 [6912/46250]	Loss: 0.2515	LR: 0.100000
Training Epoch: 2 [7168/46250]	Loss: 0.2057	LR: 0.100000
Training Epoch: 2 [7424/46250]	Loss: 0.2716	LR: 0.100000
Training Epoch: 2 [7680/46250]	Loss: 0.1796	LR: 0.100000
Training Epoch: 2 [7936/46250]	Loss: 0.2464	LR: 0.100000
Training Epoch: 2 [8192/46250]	Loss: 0.2105	LR: 0.100000
Training Epoch: 2 [8448/46250]	Loss: 0.2159	LR: 0.100000
Training Epoch: 2 [8704/46250]	Loss: 0.1822	LR: 0.100000
Training Epoch: 2 [8960/46250]	Loss: 0.2826	LR: 0.100000
Training Epoch: 2 [9216/46250]	Loss: 0.2623	LR: 0.100000
Training Epoch: 2 [9472/46250]	Loss: 0.1749	LR: 0.100000
Training Epoch: 2 [9728/46250]	Loss: 0.3498	LR: 0.100000
Training Epoch: 2 [9984/46250]	Loss: 0.2472	LR: 0.100000
Training Epoch: 2 [10240/46250]	Loss: 0.2130	LR: 0.100000
Training Epoch: 2 [10496/46250]	Loss: 0.1898	LR: 0.100000
Training Epoch: 2 [10752/46250]	Loss: 0.1865	LR: 0.100000
Training Epoch: 2 [11008/46250]	Loss: 0.2528	LR: 0.100000
Training Epoch: 2 [11264/46250]	Loss: 0.2048	LR: 0.100000
Training Epoch: 2 [11520/46250]	Loss: 0.1566	LR: 0.100000
Training Epoch: 2 [11776/46250]	Loss: 0.2994	LR: 0.100000
Training Epoch: 2 [12032/46250]	Loss: 0.2533	LR: 0.100000
Training Epoch: 2 [12288/46250]	Loss: 0.2836	LR: 0.100000
Training Epoch: 2 [12544/46250]	Loss: 0.2292	LR: 0.100000
Training Epoch: 2 [12800/46250]	Loss: 0.2149	LR: 0.100000
Training Epoch: 2 [13056/46250]	Loss: 0.2040	LR: 0.100000
Training Epoch: 2 [13312/46250]	Loss: 0.1499	LR: 0.100000
Training Epoch: 2 [13568/46250]	Loss: 0.1617	LR: 0.100000
Training Epoch: 2 [13824/46250]	Loss: 0.2055	LR: 0.100000
Training Epoch: 2 [14080/46250]	Loss: 0.2429	LR: 0.100000
Training Epoch: 2 [14336/46250]	Loss: 0.3255	LR: 0.100000
Training Epoch: 2 [14592/46250]	Loss: 0.1855	LR: 0.100000
Training Epoch: 2 [14848/46250]	Loss: 0.1860	LR: 0.100000
Training Epoch: 2 [15104/46250]	Loss: 0.1944	LR: 0.100000
Training Epoch: 2 [15360/46250]	Loss: 0.3336	LR: 0.100000
Training Epoch: 2 [15616/46250]	Loss: 0.2312	LR: 0.100000
Training Epoch: 2 [15872/46250]	Loss: 0.1773	LR: 0.100000
Training Epoch: 2 [16128/46250]	Loss: 0.2138	LR: 0.100000
Training Epoch: 2 [16384/46250]	Loss: 0.2315	LR: 0.100000
Training Epoch: 2 [16640/46250]	Loss: 0.1705	LR: 0.100000
Training Epoch: 2 [16896/46250]	Loss: 0.2199	LR: 0.100000
Training Epoch: 2 [17152/46250]	Loss: 0.2116	LR: 0.100000
Training Epoch: 2 [17408/46250]	Loss: 0.1576	LR: 0.100000
Training Epoch: 2 [17664/46250]	Loss: 0.2267	LR: 0.100000
Training Epoch: 2 [17920/46250]	Loss: 0.3027	LR: 0.100000
Training Epoch: 2 [18176/46250]	Loss: 0.1692	LR: 0.100000
Training Epoch: 2 [18432/46250]	Loss: 0.1969	LR: 0.100000
Training Epoch: 2 [18688/46250]	Loss: 0.2014	LR: 0.100000
Training Epoch: 2 [18944/46250]	Loss: 0.1772	LR: 0.100000
Training Epoch: 2 [19200/46250]	Loss: 0.2755	LR: 0.100000
Training Epoch: 2 [19456/46250]	Loss: 0.1873	LR: 0.100000
Training Epoch: 2 [19712/46250]	Loss: 0.1973	LR: 0.100000
Training Epoch: 2 [19968/46250]	Loss: 0.1681	LR: 0.100000
Training Epoch: 2 [20224/46250]	Loss: 0.1166	LR: 0.100000
Training Epoch: 2 [20480/46250]	Loss: 0.4732	LR: 0.100000
Training Epoch: 2 [20736/46250]	Loss: 0.1430	LR: 0.100000
Training Epoch: 2 [20992/46250]	Loss: 0.3101	LR: 0.100000
Training Epoch: 2 [21248/46250]	Loss: 0.3554	LR: 0.100000
Training Epoch: 2 [21504/46250]	Loss: 0.2857	LR: 0.100000
Training Epoch: 2 [21760/46250]	Loss: 0.2181	LR: 0.100000
Training Epoch: 2 [22016/46250]	Loss: 0.1802	LR: 0.100000
Training Epoch: 2 [22272/46250]	Loss: 0.2647	LR: 0.100000
Training Epoch: 2 [22528/46250]	Loss: 0.3922	LR: 0.100000
Training Epoch: 2 [22784/46250]	Loss: 0.1536	LR: 0.100000
Training Epoch: 2 [23040/46250]	Loss: 0.2512	LR: 0.100000
Training Epoch: 2 [23296/46250]	Loss: 0.2912	LR: 0.100000
Training Epoch: 2 [23552/46250]	Loss: 0.3371	LR: 0.100000
Training Epoch: 2 [23808/46250]	Loss: 0.2698	LR: 0.100000
Training Epoch: 2 [24064/46250]	Loss: 0.2714	LR: 0.100000
Training Epoch: 2 [24320/46250]	Loss: 0.2396	LR: 0.100000
Training Epoch: 2 [24576/46250]	Loss: 0.2185	LR: 0.100000
Training Epoch: 2 [24832/46250]	Loss: 0.2475	LR: 0.100000
Training Epoch: 2 [25088/46250]	Loss: 0.2145	LR: 0.100000
Training Epoch: 2 [25344/46250]	Loss: 0.3746	LR: 0.100000
Training Epoch: 2 [25600/46250]	Loss: 0.2097	LR: 0.100000
Training Epoch: 2 [25856/46250]	Loss: 0.2936	LR: 0.100000
Training Epoch: 2 [26112/46250]	Loss: 0.2085	LR: 0.100000
Training Epoch: 2 [26368/46250]	Loss: 0.2056	LR: 0.100000
Training Epoch: 2 [26624/46250]	Loss: 0.2612	LR: 0.100000
Training Epoch: 2 [26880/46250]	Loss: 0.3309	LR: 0.100000
Training Epoch: 2 [27136/46250]	Loss: 0.2951	LR: 0.100000
Training Epoch: 2 [27392/46250]	Loss: 0.3526	LR: 0.100000
Training Epoch: 2 [27648/46250]	Loss: 0.3885	LR: 0.100000
Training Epoch: 2 [27904/46250]	Loss: 0.3408	LR: 0.100000
Training Epoch: 2 [28160/46250]	Loss: 0.3197	LR: 0.100000
Training Epoch: 2 [28416/46250]	Loss: 0.3724	LR: 0.100000
Training Epoch: 2 [28672/46250]	Loss: 0.2361	LR: 0.100000
Training Epoch: 2 [28928/46250]	Loss: 0.2425	LR: 0.100000
Training Epoch: 2 [29184/46250]	Loss: 0.2738	LR: 0.100000
Training Epoch: 2 [29440/46250]	Loss: 0.2451	LR: 0.100000
Training Epoch: 2 [29696/46250]	Loss: 0.2055	LR: 0.100000
Training Epoch: 2 [29952/46250]	Loss: 0.1788	LR: 0.100000
Training Epoch: 2 [30208/46250]	Loss: 0.1900	LR: 0.100000
Training Epoch: 2 [30464/46250]	Loss: 0.2990	LR: 0.100000
Training Epoch: 2 [30720/46250]	Loss: 0.1882	LR: 0.100000
Training Epoch: 2 [30976/46250]	Loss: 0.1464	LR: 0.100000
Training Epoch: 2 [31232/46250]	Loss: 0.1680	LR: 0.100000
Training Epoch: 2 [31488/46250]	Loss: 0.2246	LR: 0.100000
Training Epoch: 2 [31744/46250]	Loss: 0.2108	LR: 0.100000
Training Epoch: 2 [32000/46250]	Loss: 0.2498	LR: 0.100000
Training Epoch: 2 [32256/46250]	Loss: 0.2864	LR: 0.100000
Training Epoch: 2 [32512/46250]	Loss: 0.2648	LR: 0.100000
Training Epoch: 2 [32768/46250]	Loss: 0.2138	LR: 0.100000
Training Epoch: 2 [33024/46250]	Loss: 0.2693	LR: 0.100000
Training Epoch: 2 [33280/46250]	Loss: 0.1837	LR: 0.100000
Training Epoch: 2 [33536/46250]	Loss: 0.1469	LR: 0.100000
Training Epoch: 2 [33792/46250]	Loss: 0.1981	LR: 0.100000
Training Epoch: 2 [34048/46250]	Loss: 0.2011	LR: 0.100000
Training Epoch: 2 [34304/46250]	Loss: 0.1508	LR: 0.100000
Training Epoch: 2 [34560/46250]	Loss: 0.1692	LR: 0.100000
Training Epoch: 2 [34816/46250]	Loss: 0.2510	LR: 0.100000
Training Epoch: 2 [35072/46250]	Loss: 0.1822	LR: 0.100000
Training Epoch: 2 [35328/46250]	Loss: 0.1984	LR: 0.100000
Training Epoch: 2 [35584/46250]	Loss: 0.2696	LR: 0.100000
Training Epoch: 2 [35840/46250]	Loss: 0.1747	LR: 0.100000
Training Epoch: 2 [36096/46250]	Loss: 0.3263	LR: 0.100000
Training Epoch: 2 [36352/46250]	Loss: 0.1950	LR: 0.100000
Training Epoch: 2 [36608/46250]	Loss: 0.1648	LR: 0.100000
Training Epoch: 2 [36864/46250]	Loss: 0.1505	LR: 0.100000
Training Epoch: 2 [37120/46250]	Loss: 0.1885	LR: 0.100000
Training Epoch: 2 [37376/46250]	Loss: 0.2141	LR: 0.100000
Training Epoch: 2 [37632/46250]	Loss: 0.1201	LR: 0.100000
Training Epoch: 2 [37888/46250]	Loss: 0.1355	LR: 0.100000
Training Epoch: 2 [38144/46250]	Loss: 0.1584	LR: 0.100000
Training Epoch: 2 [38400/46250]	Loss: 0.1558	LR: 0.100000
Training Epoch: 2 [38656/46250]	Loss: 0.1666	LR: 0.100000
Training Epoch: 2 [38912/46250]	Loss: 0.1666	LR: 0.100000
Training Epoch: 2 [39168/46250]	Loss: 0.2984	LR: 0.100000
Training Epoch: 2 [39424/46250]	Loss: 0.2847	LR: 0.100000
Training Epoch: 2 [39680/46250]	Loss: 0.1803	LR: 0.100000
Training Epoch: 2 [39936/46250]	Loss: 0.1761	LR: 0.100000
Training Epoch: 2 [40192/46250]	Loss: 0.2583	LR: 0.100000
Training Epoch: 2 [40448/46250]	Loss: 0.2608	LR: 0.100000
Training Epoch: 2 [40704/46250]	Loss: 0.2372	LR: 0.100000
Training Epoch: 2 [40960/46250]	Loss: 0.1393	LR: 0.100000
Training Epoch: 2 [41216/46250]	Loss: 0.2353	LR: 0.100000
Training Epoch: 2 [41472/46250]	Loss: 0.1939	LR: 0.100000
Training Epoch: 2 [41728/46250]	Loss: 0.1785	LR: 0.100000
Training Epoch: 2 [41984/46250]	Loss: 0.1900	LR: 0.100000
Training Epoch: 2 [42240/46250]	Loss: 0.2135	LR: 0.100000
Training Epoch: 2 [42496/46250]	Loss: 0.2448	LR: 0.100000
Training Epoch: 2 [42752/46250]	Loss: 0.1918	LR: 0.100000
Training Epoch: 2 [43008/46250]	Loss: 0.1861	LR: 0.100000
Training Epoch: 2 [43264/46250]	Loss: 0.1606	LR: 0.100000
Training Epoch: 2 [43520/46250]	Loss: 0.1175	LR: 0.100000
Training Epoch: 2 [43776/46250]	Loss: 0.1847	LR: 0.100000
Training Epoch: 2 [44032/46250]	Loss: 0.2055	LR: 0.100000
Training Epoch: 2 [44288/46250]	Loss: 0.1622	LR: 0.100000
Training Epoch: 2 [44544/46250]	Loss: 0.2128	LR: 0.100000
Training Epoch: 2 [44800/46250]	Loss: 0.2187	LR: 0.100000
Training Epoch: 2 [45056/46250]	Loss: 0.1126	LR: 0.100000
Training Epoch: 2 [45312/46250]	Loss: 0.1697	LR: 0.100000
Training Epoch: 2 [45568/46250]	Loss: 0.1468	LR: 0.100000
Training Epoch: 2 [45824/46250]	Loss: 0.1765	LR: 0.100000
Training Epoch: 2 [46080/46250]	Loss: 0.1297	LR: 0.100000
Training Epoch: 2 [46250/46250]	Loss: 0.1327	LR: 0.100000
Epoch 2 - Average Train Loss: 0.2284, Train Accuracy: 0.9240
Epoch 2 training time consumed: 334.26s
Evaluating Network.....
Test set: Epoch: 2, Average loss: 0.0005, Accuracy: 0.9594, Time consumed:23.58s
Saving weights file to checkpoint/retrain/ViT/Friday_18_July_2025_00h_08m_56s/ViT-Cifar10-seed5-ret25-2-best.pth
Training Epoch: 3 [256/46250]	Loss: 0.2027	LR: 0.100000
Training Epoch: 3 [512/46250]	Loss: 0.1496	LR: 0.100000
Training Epoch: 3 [768/46250]	Loss: 0.1867	LR: 0.100000
Training Epoch: 3 [1024/46250]	Loss: 0.2003	LR: 0.100000
Training Epoch: 3 [1280/46250]	Loss: 0.2100	LR: 0.100000
Training Epoch: 3 [1536/46250]	Loss: 0.1542	LR: 0.100000
Training Epoch: 3 [1792/46250]	Loss: 0.1323	LR: 0.100000
Training Epoch: 3 [2048/46250]	Loss: 0.1683	LR: 0.100000
Training Epoch: 3 [2304/46250]	Loss: 0.1546	LR: 0.100000
Training Epoch: 3 [2560/46250]	Loss: 0.1516	LR: 0.100000
Training Epoch: 3 [2816/46250]	Loss: 0.1552	LR: 0.100000
Training Epoch: 3 [3072/46250]	Loss: 0.1796	LR: 0.100000
Training Epoch: 3 [3328/46250]	Loss: 0.0985	LR: 0.100000
Training Epoch: 3 [3584/46250]	Loss: 0.1287	LR: 0.100000
Training Epoch: 3 [3840/46250]	Loss: 0.1661	LR: 0.100000
Training Epoch: 3 [4096/46250]	Loss: 0.1302	LR: 0.100000
Training Epoch: 3 [4352/46250]	Loss: 0.1624	LR: 0.100000
Training Epoch: 3 [4608/46250]	Loss: 0.1773	LR: 0.100000
Training Epoch: 3 [4864/46250]	Loss: 0.1280	LR: 0.100000
Training Epoch: 3 [5120/46250]	Loss: 0.1505	LR: 0.100000
Training Epoch: 3 [5376/46250]	Loss: 0.0728	LR: 0.100000
Training Epoch: 3 [5632/46250]	Loss: 0.1186	LR: 0.100000
Training Epoch: 3 [5888/46250]	Loss: 0.2236	LR: 0.100000
Training Epoch: 3 [6144/46250]	Loss: 0.1566	LR: 0.100000
Training Epoch: 3 [6400/46250]	Loss: 0.1717	LR: 0.100000
Training Epoch: 3 [6656/46250]	Loss: 0.1142	LR: 0.100000
Training Epoch: 3 [6912/46250]	Loss: 0.2058	LR: 0.100000
Training Epoch: 3 [7168/46250]	Loss: 0.1581	LR: 0.100000
Training Epoch: 3 [7424/46250]	Loss: 0.1535	LR: 0.100000
Training Epoch: 3 [7680/46250]	Loss: 0.1826	LR: 0.100000
Training Epoch: 3 [7936/46250]	Loss: 0.1296	LR: 0.100000
Training Epoch: 3 [8192/46250]	Loss: 0.0834	LR: 0.100000
Training Epoch: 3 [8448/46250]	Loss: 0.1788	LR: 0.100000
Training Epoch: 3 [8704/46250]	Loss: 0.1571	LR: 0.100000
Training Epoch: 3 [8960/46250]	Loss: 0.0788	LR: 0.100000
Training Epoch: 3 [9216/46250]	Loss: 0.1580	LR: 0.100000
Training Epoch: 3 [9472/46250]	Loss: 0.2158	LR: 0.100000
Training Epoch: 3 [9728/46250]	Loss: 0.1385	LR: 0.100000
Training Epoch: 3 [9984/46250]	Loss: 0.1638	LR: 0.100000
Training Epoch: 3 [10240/46250]	Loss: 0.1585	LR: 0.100000
Training Epoch: 3 [10496/46250]	Loss: 0.1128	LR: 0.100000
Training Epoch: 3 [10752/46250]	Loss: 0.1474	LR: 0.100000
Training Epoch: 3 [11008/46250]	Loss: 0.0968	LR: 0.100000
Training Epoch: 3 [11264/46250]	Loss: 0.1044	LR: 0.100000
Training Epoch: 3 [11520/46250]	Loss: 0.1227	LR: 0.100000
Training Epoch: 3 [11776/46250]	Loss: 0.0935	LR: 0.100000
Training Epoch: 3 [12032/46250]	Loss: 0.1613	LR: 0.100000
Training Epoch: 3 [12288/46250]	Loss: 0.1390	LR: 0.100000
Training Epoch: 3 [12544/46250]	Loss: 0.1588	LR: 0.100000
Training Epoch: 3 [12800/46250]	Loss: 0.1751	LR: 0.100000
Training Epoch: 3 [13056/46250]	Loss: 0.1701	LR: 0.100000
Training Epoch: 3 [13312/46250]	Loss: 0.2097	LR: 0.100000
Training Epoch: 3 [13568/46250]	Loss: 0.1559	LR: 0.100000
Training Epoch: 3 [13824/46250]	Loss: 0.1782	LR: 0.100000
Training Epoch: 3 [14080/46250]	Loss: 0.1458	LR: 0.100000
Training Epoch: 3 [14336/46250]	Loss: 0.2168	LR: 0.100000
Training Epoch: 3 [14592/46250]	Loss: 0.1516	LR: 0.100000
Training Epoch: 3 [14848/46250]	Loss: 0.1893	LR: 0.100000
Training Epoch: 3 [15104/46250]	Loss: 0.1088	LR: 0.100000
Training Epoch: 3 [15360/46250]	Loss: 0.0961	LR: 0.100000
Training Epoch: 3 [15616/46250]	Loss: 0.1431	LR: 0.100000
Training Epoch: 3 [15872/46250]	Loss: 0.1795	LR: 0.100000
Training Epoch: 3 [16128/46250]	Loss: 0.0998	LR: 0.100000
Training Epoch: 3 [16384/46250]	Loss: 0.1093	LR: 0.100000
Training Epoch: 3 [16640/46250]	Loss: 0.1280	LR: 0.100000
Training Epoch: 3 [16896/46250]	Loss: 0.0852	LR: 0.100000
Training Epoch: 3 [17152/46250]	Loss: 0.1090	LR: 0.100000
Training Epoch: 3 [17408/46250]	Loss: 0.1880	LR: 0.100000
Training Epoch: 3 [17664/46250]	Loss: 0.1417	LR: 0.100000
Training Epoch: 3 [17920/46250]	Loss: 0.1031	LR: 0.100000
Training Epoch: 3 [18176/46250]	Loss: 0.1061	LR: 0.100000
Training Epoch: 3 [18432/46250]	Loss: 0.1639	LR: 0.100000
Training Epoch: 3 [18688/46250]	Loss: 0.1258	LR: 0.100000
Training Epoch: 3 [18944/46250]	Loss: 0.1626	LR: 0.100000
Training Epoch: 3 [19200/46250]	Loss: 0.1640	LR: 0.100000
Training Epoch: 3 [19456/46250]	Loss: 0.1916	LR: 0.100000
Training Epoch: 3 [19712/46250]	Loss: 0.1057	LR: 0.100000
Training Epoch: 3 [19968/46250]	Loss: 0.2005	LR: 0.100000
Training Epoch: 3 [20224/46250]	Loss: 0.1544	LR: 0.100000
Training Epoch: 3 [20480/46250]	Loss: 0.2349	LR: 0.100000
Training Epoch: 3 [20736/46250]	Loss: 0.1457	LR: 0.100000
Training Epoch: 3 [20992/46250]	Loss: 0.1123	LR: 0.100000
Training Epoch: 3 [21248/46250]	Loss: 0.1522	LR: 0.100000
Training Epoch: 3 [21504/46250]	Loss: 0.1600	LR: 0.100000
Training Epoch: 3 [21760/46250]	Loss: 0.1582	LR: 0.100000
Training Epoch: 3 [22016/46250]	Loss: 0.1284	LR: 0.100000
Training Epoch: 3 [22272/46250]	Loss: 0.2626	LR: 0.100000
Training Epoch: 3 [22528/46250]	Loss: 0.1923	LR: 0.100000
Training Epoch: 3 [22784/46250]	Loss: 0.1501	LR: 0.100000
Training Epoch: 3 [23040/46250]	Loss: 0.1782	LR: 0.100000
Training Epoch: 3 [23296/46250]	Loss: 0.2096	LR: 0.100000
Training Epoch: 3 [23552/46250]	Loss: 0.1615	LR: 0.100000
Training Epoch: 3 [23808/46250]	Loss: 0.2312	LR: 0.100000
Training Epoch: 3 [24064/46250]	Loss: 0.1988	LR: 0.100000
Training Epoch: 3 [24320/46250]	Loss: 0.2342	LR: 0.100000
Training Epoch: 3 [24576/46250]	Loss: 0.1132	LR: 0.100000
Training Epoch: 3 [24832/46250]	Loss: 0.1977	LR: 0.100000
Training Epoch: 3 [25088/46250]	Loss: 0.2213	LR: 0.100000
Training Epoch: 3 [25344/46250]	Loss: 0.1689	LR: 0.100000
Training Epoch: 3 [25600/46250]	Loss: 0.1992	LR: 0.100000
Training Epoch: 3 [25856/46250]	Loss: 0.1321	LR: 0.100000
Training Epoch: 3 [26112/46250]	Loss: 0.1354	LR: 0.100000
Training Epoch: 3 [26368/46250]	Loss: 0.1188	LR: 0.100000
Training Epoch: 3 [26624/46250]	Loss: 0.0677	LR: 0.100000
Training Epoch: 3 [26880/46250]	Loss: 0.1440	LR: 0.100000
Training Epoch: 3 [27136/46250]	Loss: 0.1875	LR: 0.100000
Training Epoch: 3 [27392/46250]	Loss: 0.1828	LR: 0.100000
Training Epoch: 3 [27648/46250]	Loss: 0.2014	LR: 0.100000
Training Epoch: 3 [27904/46250]	Loss: 0.1035	LR: 0.100000
Training Epoch: 3 [28160/46250]	Loss: 0.1696	LR: 0.100000
Training Epoch: 3 [28416/46250]	Loss: 0.1758	LR: 0.100000
Training Epoch: 3 [28672/46250]	Loss: 0.1624	LR: 0.100000
Training Epoch: 3 [28928/46250]	Loss: 0.0870	LR: 0.100000
Training Epoch: 3 [29184/46250]	Loss: 0.1053	LR: 0.100000
Training Epoch: 3 [29440/46250]	Loss: 0.1409	LR: 0.100000
Training Epoch: 3 [29696/46250]	Loss: 0.1688	LR: 0.100000
Training Epoch: 3 [29952/46250]	Loss: 0.2249	LR: 0.100000
Training Epoch: 3 [30208/46250]	Loss: 0.2189	LR: 0.100000
Training Epoch: 3 [30464/46250]	Loss: 0.1308	LR: 0.100000
Training Epoch: 3 [30720/46250]	Loss: 0.0810	LR: 0.100000
Training Epoch: 3 [30976/46250]	Loss: 0.1774	LR: 0.100000
Training Epoch: 3 [31232/46250]	Loss: 0.2417	LR: 0.100000
Training Epoch: 3 [31488/46250]	Loss: 0.1495	LR: 0.100000
Training Epoch: 3 [31744/46250]	Loss: 0.1564	LR: 0.100000
Training Epoch: 3 [32000/46250]	Loss: 0.1581	LR: 0.100000
Training Epoch: 3 [32256/46250]	Loss: 0.1315	LR: 0.100000
Training Epoch: 3 [32512/46250]	Loss: 0.1813	LR: 0.100000
Training Epoch: 3 [32768/46250]	Loss: 0.1554	LR: 0.100000
Training Epoch: 3 [33024/46250]	Loss: 0.1277	LR: 0.100000
Training Epoch: 3 [33280/46250]	Loss: 0.1354	LR: 0.100000
Training Epoch: 3 [33536/46250]	Loss: 0.1755	LR: 0.100000
Training Epoch: 3 [33792/46250]	Loss: 0.1231	LR: 0.100000
Training Epoch: 3 [34048/46250]	Loss: 0.1237	LR: 0.100000
Training Epoch: 3 [34304/46250]	Loss: 0.1095	LR: 0.100000
Training Epoch: 3 [34560/46250]	Loss: 0.1188	LR: 0.100000
Training Epoch: 3 [34816/46250]	Loss: 0.0997	LR: 0.100000
Training Epoch: 3 [35072/46250]	Loss: 0.1386	LR: 0.100000
Training Epoch: 3 [35328/46250]	Loss: 0.1943	LR: 0.100000
Training Epoch: 3 [35584/46250]	Loss: 0.1279	LR: 0.100000
Training Epoch: 3 [35840/46250]	Loss: 0.2070	LR: 0.100000
Training Epoch: 3 [36096/46250]	Loss: 0.1021	LR: 0.100000
Training Epoch: 3 [36352/46250]	Loss: 0.1628	LR: 0.100000
Training Epoch: 3 [36608/46250]	Loss: 0.1348	LR: 0.100000
Training Epoch: 3 [36864/46250]	Loss: 0.1486	LR: 0.100000
Training Epoch: 3 [37120/46250]	Loss: 0.1557	LR: 0.100000
Training Epoch: 3 [37376/46250]	Loss: 0.1122	LR: 0.100000
Training Epoch: 3 [37632/46250]	Loss: 0.2496	LR: 0.100000
Training Epoch: 3 [37888/46250]	Loss: 0.1230	LR: 0.100000
Training Epoch: 3 [38144/46250]	Loss: 0.1904	LR: 0.100000
Training Epoch: 3 [38400/46250]	Loss: 0.2134	LR: 0.100000
Training Epoch: 3 [38656/46250]	Loss: 0.1659	LR: 0.100000
Training Epoch: 3 [38912/46250]	Loss: 0.1836	LR: 0.100000
Training Epoch: 3 [39168/46250]	Loss: 0.2292	LR: 0.100000
Training Epoch: 3 [39424/46250]	Loss: 0.1773	LR: 0.100000
Training Epoch: 3 [39680/46250]	Loss: 0.2174	LR: 0.100000
Training Epoch: 3 [39936/46250]	Loss: 0.1684	LR: 0.100000
Training Epoch: 3 [40192/46250]	Loss: 0.1852	LR: 0.100000
Training Epoch: 3 [40448/46250]	Loss: 0.1663	LR: 0.100000
Training Epoch: 3 [40704/46250]	Loss: 0.1301	LR: 0.100000
Training Epoch: 3 [40960/46250]	Loss: 0.1654	LR: 0.100000
Training Epoch: 3 [41216/46250]	Loss: 0.1246	LR: 0.100000
Training Epoch: 3 [41472/46250]	Loss: 0.1573	LR: 0.100000
Training Epoch: 3 [41728/46250]	Loss: 0.1362	LR: 0.100000
Training Epoch: 3 [41984/46250]	Loss: 0.1507	LR: 0.100000
Training Epoch: 3 [42240/46250]	Loss: 0.1758	LR: 0.100000
Training Epoch: 3 [42496/46250]	Loss: 0.1802	LR: 0.100000
Training Epoch: 3 [42752/46250]	Loss: 0.1741	LR: 0.100000
Training Epoch: 3 [43008/46250]	Loss: 0.1445	LR: 0.100000
Training Epoch: 3 [43264/46250]	Loss: 0.1289	LR: 0.100000
Training Epoch: 3 [43520/46250]	Loss: 0.1642	LR: 0.100000
Training Epoch: 3 [43776/46250]	Loss: 0.2019	LR: 0.100000
Training Epoch: 3 [44032/46250]	Loss: 0.1653	LR: 0.100000
Training Epoch: 3 [44288/46250]	Loss: 0.1021	LR: 0.100000
Training Epoch: 3 [44544/46250]	Loss: 0.2466	LR: 0.100000
Training Epoch: 3 [44800/46250]	Loss: 0.2234	LR: 0.100000
Training Epoch: 3 [45056/46250]	Loss: 0.1709	LR: 0.100000
Training Epoch: 3 [45312/46250]	Loss: 0.2301	LR: 0.100000
Training Epoch: 3 [45568/46250]	Loss: 0.1843	LR: 0.100000
Training Epoch: 3 [45824/46250]	Loss: 0.2348	LR: 0.100000
Training Epoch: 3 [46080/46250]	Loss: 0.1767	LR: 0.100000
Training Epoch: 3 [46250/46250]	Loss: 0.1398	LR: 0.100000
Epoch 3 - Average Train Loss: 0.1579, Train Accuracy: 0.9467
Epoch 3 training time consumed: 334.52s
Evaluating Network.....
Test set: Epoch: 3, Average loss: 0.0006, Accuracy: 0.9495, Time consumed:23.58s
Training Epoch: 4 [256/46250]	Loss: 0.1590	LR: 0.100000
Training Epoch: 4 [512/46250]	Loss: 0.0853	LR: 0.100000
Training Epoch: 4 [768/46250]	Loss: 0.1975	LR: 0.100000
Training Epoch: 4 [1024/46250]	Loss: 0.1599	LR: 0.100000
Training Epoch: 4 [1280/46250]	Loss: 0.1272	LR: 0.100000
Training Epoch: 4 [1536/46250]	Loss: 0.1439	LR: 0.100000
Training Epoch: 4 [1792/46250]	Loss: 0.1756	LR: 0.100000
Training Epoch: 4 [2048/46250]	Loss: 0.1142	LR: 0.100000
Training Epoch: 4 [2304/46250]	Loss: 0.1197	LR: 0.100000
Training Epoch: 4 [2560/46250]	Loss: 0.2248	LR: 0.100000
Training Epoch: 4 [2816/46250]	Loss: 0.1800	LR: 0.100000
Training Epoch: 4 [3072/46250]	Loss: 0.1398	LR: 0.100000
Training Epoch: 4 [3328/46250]	Loss: 0.1346	LR: 0.100000
Training Epoch: 4 [3584/46250]	Loss: 0.1377	LR: 0.100000
Training Epoch: 4 [3840/46250]	Loss: 0.1512	LR: 0.100000
Training Epoch: 4 [4096/46250]	Loss: 0.2081	LR: 0.100000
Training Epoch: 4 [4352/46250]	Loss: 0.1777	LR: 0.100000
Training Epoch: 4 [4608/46250]	Loss: 0.1353	LR: 0.100000
Training Epoch: 4 [4864/46250]	Loss: 0.2121	LR: 0.100000
Training Epoch: 4 [5120/46250]	Loss: 0.2170	LR: 0.100000
Training Epoch: 4 [5376/46250]	Loss: 0.1458	LR: 0.100000
Training Epoch: 4 [5632/46250]	Loss: 0.1085	LR: 0.100000
Training Epoch: 4 [5888/46250]	Loss: 0.0686	LR: 0.100000
Training Epoch: 4 [6144/46250]	Loss: 0.1789	LR: 0.100000
Training Epoch: 4 [6400/46250]	Loss: 0.1507	LR: 0.100000
Training Epoch: 4 [6656/46250]	Loss: 0.2267	LR: 0.100000
Training Epoch: 4 [6912/46250]	Loss: 0.1961	LR: 0.100000
Training Epoch: 4 [7168/46250]	Loss: 0.1789	LR: 0.100000
Training Epoch: 4 [7424/46250]	Loss: 0.1763	LR: 0.100000
Training Epoch: 4 [7680/46250]	Loss: 0.2810	LR: 0.100000
Training Epoch: 4 [7936/46250]	Loss: 0.1708	LR: 0.100000
Training Epoch: 4 [8192/46250]	Loss: 0.1519	LR: 0.100000
Training Epoch: 4 [8448/46250]	Loss: 0.1623	LR: 0.100000
Training Epoch: 4 [8704/46250]	Loss: 0.2125	LR: 0.100000
Training Epoch: 4 [8960/46250]	Loss: 0.1208	LR: 0.100000
Training Epoch: 4 [9216/46250]	Loss: 0.1536	LR: 0.100000
Training Epoch: 4 [9472/46250]	Loss: 0.1896	LR: 0.100000
Training Epoch: 4 [9728/46250]	Loss: 0.1863	LR: 0.100000
Training Epoch: 4 [9984/46250]	Loss: 0.1390	LR: 0.100000
Training Epoch: 4 [10240/46250]	Loss: 0.1591	LR: 0.100000
Training Epoch: 4 [10496/46250]	Loss: 0.1867	LR: 0.100000
Training Epoch: 4 [10752/46250]	Loss: 0.1438	LR: 0.100000
Training Epoch: 4 [11008/46250]	Loss: 0.2004	LR: 0.100000
Training Epoch: 4 [11264/46250]	Loss: 0.1453	LR: 0.100000
Training Epoch: 4 [11520/46250]	Loss: 0.2051	LR: 0.100000
Training Epoch: 4 [11776/46250]	Loss: 0.1188	LR: 0.100000
Training Epoch: 4 [12032/46250]	Loss: 0.0901	LR: 0.100000
Training Epoch: 4 [12288/46250]	Loss: 0.1749	LR: 0.100000
Training Epoch: 4 [12544/46250]	Loss: 0.1446	LR: 0.100000
Training Epoch: 4 [12800/46250]	Loss: 0.1932	LR: 0.100000
Training Epoch: 4 [13056/46250]	Loss: 0.1327	LR: 0.100000
Training Epoch: 4 [13312/46250]	Loss: 0.2152	LR: 0.100000
Training Epoch: 4 [13568/46250]	Loss: 0.1684	LR: 0.100000
Training Epoch: 4 [13824/46250]	Loss: 0.1616	LR: 0.100000
Training Epoch: 4 [14080/46250]	Loss: 0.1287	LR: 0.100000
Training Epoch: 4 [14336/46250]	Loss: 0.1287	LR: 0.100000
Training Epoch: 4 [14592/46250]	Loss: 0.0846	LR: 0.100000
Training Epoch: 4 [14848/46250]	Loss: 0.1083	LR: 0.100000
Training Epoch: 4 [15104/46250]	Loss: 0.1489	LR: 0.100000
Training Epoch: 4 [15360/46250]	Loss: 0.1482	LR: 0.100000
Training Epoch: 4 [15616/46250]	Loss: 0.1079	LR: 0.100000
Training Epoch: 4 [15872/46250]	Loss: 0.1752	LR: 0.100000
Training Epoch: 4 [16128/46250]	Loss: 0.1369	LR: 0.100000
Training Epoch: 4 [16384/46250]	Loss: 0.1482	LR: 0.100000
Training Epoch: 4 [16640/46250]	Loss: 0.0904	LR: 0.100000
Training Epoch: 4 [16896/46250]	Loss: 0.2417	LR: 0.100000
Training Epoch: 4 [17152/46250]	Loss: 0.0981	LR: 0.100000
Training Epoch: 4 [17408/46250]	Loss: 0.1560	LR: 0.100000
Training Epoch: 4 [17664/46250]	Loss: 0.1502	LR: 0.100000
Training Epoch: 4 [17920/46250]	Loss: 0.1708	LR: 0.100000
Training Epoch: 4 [18176/46250]	Loss: 0.1720	LR: 0.100000
Training Epoch: 4 [18432/46250]	Loss: 0.2062	LR: 0.100000
Training Epoch: 4 [18688/46250]	Loss: 0.1614	LR: 0.100000
Training Epoch: 4 [18944/46250]	Loss: 0.1785	LR: 0.100000
Training Epoch: 4 [19200/46250]	Loss: 0.2130	LR: 0.100000
Training Epoch: 4 [19456/46250]	Loss: 0.1039	LR: 0.100000
Training Epoch: 4 [19712/46250]	Loss: 0.1533	LR: 0.100000
Training Epoch: 4 [19968/46250]	Loss: 0.2713	LR: 0.100000
Training Epoch: 4 [20224/46250]	Loss: 0.1763	LR: 0.100000
Training Epoch: 4 [20480/46250]	Loss: 0.2543	LR: 0.100000
Training Epoch: 4 [20736/46250]	Loss: 0.2199	LR: 0.100000
Training Epoch: 4 [20992/46250]	Loss: 0.2390	LR: 0.100000
Training Epoch: 4 [21248/46250]	Loss: 0.1909	LR: 0.100000
Training Epoch: 4 [21504/46250]	Loss: 0.2523	LR: 0.100000
Training Epoch: 4 [21760/46250]	Loss: 0.1995	LR: 0.100000
Training Epoch: 4 [22016/46250]	Loss: 0.2922	LR: 0.100000
Training Epoch: 4 [22272/46250]	Loss: 0.2618	LR: 0.100000
Training Epoch: 4 [22528/46250]	Loss: 0.1277	LR: 0.100000
Training Epoch: 4 [22784/46250]	Loss: 0.1416	LR: 0.100000
Training Epoch: 4 [23040/46250]	Loss: 0.3013	LR: 0.100000
Training Epoch: 4 [23296/46250]	Loss: 0.1082	LR: 0.100000
Training Epoch: 4 [23552/46250]	Loss: 0.1845	LR: 0.100000
Training Epoch: 4 [23808/46250]	Loss: 0.2466	LR: 0.100000
Training Epoch: 4 [24064/46250]	Loss: 0.1673	LR: 0.100000
Training Epoch: 4 [24320/46250]	Loss: 0.2529	LR: 0.100000
Training Epoch: 4 [24576/46250]	Loss: 0.1930	LR: 0.100000
Training Epoch: 4 [24832/46250]	Loss: 0.0861	LR: 0.100000
Training Epoch: 4 [25088/46250]	Loss: 0.1695	LR: 0.100000
Training Epoch: 4 [25344/46250]	Loss: 0.2243	LR: 0.100000
Training Epoch: 4 [25600/46250]	Loss: 0.1412	LR: 0.100000
Training Epoch: 4 [25856/46250]	Loss: 0.1014	LR: 0.100000
Training Epoch: 4 [26112/46250]	Loss: 0.1385	LR: 0.100000
Training Epoch: 4 [26368/46250]	Loss: 0.1434	LR: 0.100000
Training Epoch: 4 [26624/46250]	Loss: 0.1489	LR: 0.100000
Training Epoch: 4 [26880/46250]	Loss: 0.1545	LR: 0.100000
Training Epoch: 4 [27136/46250]	Loss: 0.1285	LR: 0.100000
Training Epoch: 4 [27392/46250]	Loss: 0.1158	LR: 0.100000
Training Epoch: 4 [27648/46250]	Loss: 0.1816	LR: 0.100000
Training Epoch: 4 [27904/46250]	Loss: 0.1718	LR: 0.100000
Training Epoch: 4 [28160/46250]	Loss: 0.1389	LR: 0.100000
Training Epoch: 4 [28416/46250]	Loss: 0.2382	LR: 0.100000
Training Epoch: 4 [28672/46250]	Loss: 0.1794	LR: 0.100000
Training Epoch: 4 [28928/46250]	Loss: 0.1429	LR: 0.100000
Training Epoch: 4 [29184/46250]	Loss: 0.1488	LR: 0.100000
Training Epoch: 4 [29440/46250]	Loss: 0.1992	LR: 0.100000
Training Epoch: 4 [29696/46250]	Loss: 0.1778	LR: 0.100000
Training Epoch: 4 [29952/46250]	Loss: 0.1889	LR: 0.100000
Training Epoch: 4 [30208/46250]	Loss: 0.1613	LR: 0.100000
Training Epoch: 4 [30464/46250]	Loss: 0.1676	LR: 0.100000
Training Epoch: 4 [30720/46250]	Loss: 0.2294	LR: 0.100000
Training Epoch: 4 [30976/46250]	Loss: 0.1798	LR: 0.100000
Training Epoch: 4 [31232/46250]	Loss: 0.2830	LR: 0.100000
Training Epoch: 4 [31488/46250]	Loss: 0.0957	LR: 0.100000
Training Epoch: 4 [31744/46250]	Loss: 0.0822	LR: 0.100000
Training Epoch: 4 [32000/46250]	Loss: 0.1266	LR: 0.100000
Training Epoch: 4 [32256/46250]	Loss: 0.1476	LR: 0.100000
Training Epoch: 4 [32512/46250]	Loss: 0.1932	LR: 0.100000
Training Epoch: 4 [32768/46250]	Loss: 0.1618	LR: 0.100000
Training Epoch: 4 [33024/46250]	Loss: 0.1177	LR: 0.100000
Training Epoch: 4 [33280/46250]	Loss: 0.1597	LR: 0.100000
Training Epoch: 4 [33536/46250]	Loss: 0.1886	LR: 0.100000
Training Epoch: 4 [33792/46250]	Loss: 0.1674	LR: 0.100000
Training Epoch: 4 [34048/46250]	Loss: 0.2791	LR: 0.100000
Training Epoch: 4 [34304/46250]	Loss: 0.1353	LR: 0.100000
Training Epoch: 4 [34560/46250]	Loss: 0.1594	LR: 0.100000
Training Epoch: 4 [34816/46250]	Loss: 0.1887	LR: 0.100000
Training Epoch: 4 [35072/46250]	Loss: 0.2192	LR: 0.100000
Training Epoch: 4 [35328/46250]	Loss: 0.1935	LR: 0.100000
Training Epoch: 4 [35584/46250]	Loss: 0.0986	LR: 0.100000
Training Epoch: 4 [35840/46250]	Loss: 0.1301	LR: 0.100000
Training Epoch: 4 [36096/46250]	Loss: 0.1941	LR: 0.100000
Training Epoch: 4 [36352/46250]	Loss: 0.1762	LR: 0.100000
Training Epoch: 4 [36608/46250]	Loss: 0.2080	LR: 0.100000
Training Epoch: 4 [36864/46250]	Loss: 0.1428	LR: 0.100000
Training Epoch: 4 [37120/46250]	Loss: 0.1431	LR: 0.100000
Training Epoch: 4 [37376/46250]	Loss: 0.1672	LR: 0.100000
Training Epoch: 4 [37632/46250]	Loss: 0.2073	LR: 0.100000
Training Epoch: 4 [37888/46250]	Loss: 0.1596	LR: 0.100000
Training Epoch: 4 [38144/46250]	Loss: 0.1332	LR: 0.100000
Training Epoch: 4 [38400/46250]	Loss: 0.1218	LR: 0.100000
Training Epoch: 4 [38656/46250]	Loss: 0.1468	LR: 0.100000
Training Epoch: 4 [38912/46250]	Loss: 0.2341	LR: 0.100000
Training Epoch: 4 [39168/46250]	Loss: 0.1617	LR: 0.100000
Training Epoch: 4 [39424/46250]	Loss: 0.1455	LR: 0.100000
Training Epoch: 4 [39680/46250]	Loss: 0.1183	LR: 0.100000
Training Epoch: 4 [39936/46250]	Loss: 0.1765	LR: 0.100000
Training Epoch: 4 [40192/46250]	Loss: 0.1325	LR: 0.100000
Training Epoch: 4 [40448/46250]	Loss: 0.1285	LR: 0.100000
Training Epoch: 4 [40704/46250]	Loss: 0.2177	LR: 0.100000
Training Epoch: 4 [40960/46250]	Loss: 0.1606	LR: 0.100000
Training Epoch: 4 [41216/46250]	Loss: 0.1379	LR: 0.100000
Training Epoch: 4 [41472/46250]	Loss: 0.1677	LR: 0.100000
Training Epoch: 4 [41728/46250]	Loss: 0.1470	LR: 0.100000
Training Epoch: 4 [41984/46250]	Loss: 0.1227	LR: 0.100000
Training Epoch: 4 [42240/46250]	Loss: 0.1301	LR: 0.100000
Training Epoch: 4 [42496/46250]	Loss: 0.1904	LR: 0.100000
Training Epoch: 4 [42752/46250]	Loss: 0.1450	LR: 0.100000
Training Epoch: 4 [43008/46250]	Loss: 0.1579	LR: 0.100000
Training Epoch: 4 [43264/46250]	Loss: 0.2112	LR: 0.100000
Training Epoch: 4 [43520/46250]	Loss: 0.1745	LR: 0.100000
Training Epoch: 4 [43776/46250]	Loss: 0.1918	LR: 0.100000
Training Epoch: 4 [44032/46250]	Loss: 0.2245	LR: 0.100000
Training Epoch: 4 [44288/46250]	Loss: 0.1693	LR: 0.100000
Training Epoch: 4 [44544/46250]	Loss: 0.1588	LR: 0.100000
Training Epoch: 4 [44800/46250]	Loss: 0.1913	LR: 0.100000
Training Epoch: 4 [45056/46250]	Loss: 0.2696	LR: 0.100000
Training Epoch: 4 [45312/46250]	Loss: 0.1777	LR: 0.100000
Training Epoch: 4 [45568/46250]	Loss: 0.1281	LR: 0.100000
Training Epoch: 4 [45824/46250]	Loss: 0.2549	LR: 0.100000
Training Epoch: 4 [46080/46250]	Loss: 0.1759	LR: 0.100000
Training Epoch: 4 [46250/46250]	Loss: 0.2030	LR: 0.100000
Epoch 4 - Average Train Loss: 0.1686, Train Accuracy: 0.9425
Epoch 4 training time consumed: 334.46s
Evaluating Network.....
Test set: Epoch: 4, Average loss: 0.0006, Accuracy: 0.9526, Time consumed:23.55s
Training Epoch: 5 [256/46250]	Loss: 0.1180	LR: 0.100000
Training Epoch: 5 [512/46250]	Loss: 0.3052	LR: 0.100000
Training Epoch: 5 [768/46250]	Loss: 0.1329	LR: 0.100000
Training Epoch: 5 [1024/46250]	Loss: 0.1297	LR: 0.100000
Training Epoch: 5 [1280/46250]	Loss: 0.2512	LR: 0.100000
Training Epoch: 5 [1536/46250]	Loss: 0.1348	LR: 0.100000
Training Epoch: 5 [1792/46250]	Loss: 0.1003	LR: 0.100000
Training Epoch: 5 [2048/46250]	Loss: 0.2280	LR: 0.100000
Training Epoch: 5 [2304/46250]	Loss: 0.0762	LR: 0.100000
Training Epoch: 5 [2560/46250]	Loss: 0.1140	LR: 0.100000
Training Epoch: 5 [2816/46250]	Loss: 0.2506	LR: 0.100000
Training Epoch: 5 [3072/46250]	Loss: 0.1211	LR: 0.100000
Training Epoch: 5 [3328/46250]	Loss: 0.1979	LR: 0.100000
Training Epoch: 5 [3584/46250]	Loss: 0.1475	LR: 0.100000
Training Epoch: 5 [3840/46250]	Loss: 0.1774	LR: 0.100000
Training Epoch: 5 [4096/46250]	Loss: 0.1140	LR: 0.100000
Training Epoch: 5 [4352/46250]	Loss: 0.1651	LR: 0.100000
Training Epoch: 5 [4608/46250]	Loss: 0.1147	LR: 0.100000
Training Epoch: 5 [4864/46250]	Loss: 0.1133	LR: 0.100000
Training Epoch: 5 [5120/46250]	Loss: 0.1659	LR: 0.100000
Training Epoch: 5 [5376/46250]	Loss: 0.0619	LR: 0.100000
Training Epoch: 5 [5632/46250]	Loss: 0.1234	LR: 0.100000
Training Epoch: 5 [5888/46250]	Loss: 0.1049	LR: 0.100000
Training Epoch: 5 [6144/46250]	Loss: 0.1196	LR: 0.100000
Training Epoch: 5 [6400/46250]	Loss: 0.1395	LR: 0.100000
Training Epoch: 5 [6656/46250]	Loss: 0.2577	LR: 0.100000
Training Epoch: 5 [6912/46250]	Loss: 0.1425	LR: 0.100000
Training Epoch: 5 [7168/46250]	Loss: 0.1770	LR: 0.100000
Training Epoch: 5 [7424/46250]	Loss: 0.1485	LR: 0.100000
Training Epoch: 5 [7680/46250]	Loss: 0.1098	LR: 0.100000
Training Epoch: 5 [7936/46250]	Loss: 0.0744	LR: 0.100000
Training Epoch: 5 [8192/46250]	Loss: 0.1379	LR: 0.100000
Training Epoch: 5 [8448/46250]	Loss: 0.1577	LR: 0.100000
Training Epoch: 5 [8704/46250]	Loss: 0.1100	LR: 0.100000
Training Epoch: 5 [8960/46250]	Loss: 0.1418	LR: 0.100000
Training Epoch: 5 [9216/46250]	Loss: 0.2093	LR: 0.100000
Training Epoch: 5 [9472/46250]	Loss: 0.1707	LR: 0.100000
Training Epoch: 5 [9728/46250]	Loss: 0.2331	LR: 0.100000
Training Epoch: 5 [9984/46250]	Loss: 0.2416	LR: 0.100000
Training Epoch: 5 [10240/46250]	Loss: 0.2026	LR: 0.100000
Training Epoch: 5 [10496/46250]	Loss: 0.1638	LR: 0.100000
Training Epoch: 5 [10752/46250]	Loss: 0.1735	LR: 0.100000
Training Epoch: 5 [11008/46250]	Loss: 0.1468	LR: 0.100000
Training Epoch: 5 [11264/46250]	Loss: 0.1861	LR: 0.100000
Training Epoch: 5 [11520/46250]	Loss: 0.0936	LR: 0.100000
Training Epoch: 5 [11776/46250]	Loss: 0.1433	LR: 0.100000
Training Epoch: 5 [12032/46250]	Loss: 0.1445	LR: 0.100000
Training Epoch: 5 [12288/46250]	Loss: 0.1615	LR: 0.100000
Training Epoch: 5 [12544/46250]	Loss: 0.1136	LR: 0.100000
Training Epoch: 5 [12800/46250]	Loss: 0.1790	LR: 0.100000
Training Epoch: 5 [13056/46250]	Loss: 0.1800	LR: 0.100000
Training Epoch: 5 [13312/46250]	Loss: 0.1187	LR: 0.100000
Training Epoch: 5 [13568/46250]	Loss: 0.1195	LR: 0.100000
Training Epoch: 5 [13824/46250]	Loss: 0.2052	LR: 0.100000
Training Epoch: 5 [14080/46250]	Loss: 0.1800	LR: 0.100000
Training Epoch: 5 [14336/46250]	Loss: 0.1514	LR: 0.100000
Training Epoch: 5 [14592/46250]	Loss: 0.1250	LR: 0.100000
Training Epoch: 5 [14848/46250]	Loss: 0.1601	LR: 0.100000
Training Epoch: 5 [15104/46250]	Loss: 0.1940	LR: 0.100000
Training Epoch: 5 [15360/46250]	Loss: 0.1640	LR: 0.100000
Training Epoch: 5 [15616/46250]	Loss: 0.0881	LR: 0.100000
Training Epoch: 5 [15872/46250]	Loss: 0.1226	LR: 0.100000
Training Epoch: 5 [16128/46250]	Loss: 0.1973	LR: 0.100000
Training Epoch: 5 [16384/46250]	Loss: 0.1342	LR: 0.100000
Training Epoch: 5 [16640/46250]	Loss: 0.1332	LR: 0.100000
Training Epoch: 5 [16896/46250]	Loss: 0.2362	LR: 0.100000
Training Epoch: 5 [17152/46250]	Loss: 0.1045	LR: 0.100000
Training Epoch: 5 [17408/46250]	Loss: 0.1565	LR: 0.100000
Training Epoch: 5 [17664/46250]	Loss: 0.1470	LR: 0.100000
Training Epoch: 5 [17920/46250]	Loss: 0.1564	LR: 0.100000
Training Epoch: 5 [18176/46250]	Loss: 0.1620	LR: 0.100000
Training Epoch: 5 [18432/46250]	Loss: 0.1630	LR: 0.100000
Training Epoch: 5 [18688/46250]	Loss: 0.1688	LR: 0.100000
Training Epoch: 5 [18944/46250]	Loss: 0.1408	LR: 0.100000
Training Epoch: 5 [19200/46250]	Loss: 0.1472	LR: 0.100000
Training Epoch: 5 [19456/46250]	Loss: 0.1671	LR: 0.100000
Training Epoch: 5 [19712/46250]	Loss: 0.0961	LR: 0.100000
Training Epoch: 5 [19968/46250]	Loss: 0.1228	LR: 0.100000
Training Epoch: 5 [20224/46250]	Loss: 0.1373	LR: 0.100000
Training Epoch: 5 [20480/46250]	Loss: 0.1376	LR: 0.100000
Training Epoch: 5 [20736/46250]	Loss: 0.0929	LR: 0.100000
Training Epoch: 5 [20992/46250]	Loss: 0.1413	LR: 0.100000
Training Epoch: 5 [21248/46250]	Loss: 0.1875	LR: 0.100000
Training Epoch: 5 [21504/46250]	Loss: 0.0844	LR: 0.100000
Training Epoch: 5 [21760/46250]	Loss: 0.1025	LR: 0.100000
Training Epoch: 5 [22016/46250]	Loss: 0.2350	LR: 0.100000
Training Epoch: 5 [22272/46250]	Loss: 0.2196	LR: 0.100000
Training Epoch: 5 [22528/46250]	Loss: 0.1395	LR: 0.100000
Training Epoch: 5 [22784/46250]	Loss: 0.1478	LR: 0.100000
Training Epoch: 5 [23040/46250]	Loss: 0.1902	LR: 0.100000
Training Epoch: 5 [23296/46250]	Loss: 0.1488	LR: 0.100000
Training Epoch: 5 [23552/46250]	Loss: 0.1045	LR: 0.100000
Training Epoch: 5 [23808/46250]	Loss: 0.2254	LR: 0.100000
Training Epoch: 5 [24064/46250]	Loss: 0.2180	LR: 0.100000
Training Epoch: 5 [24320/46250]	Loss: 0.1649	LR: 0.100000
Training Epoch: 5 [24576/46250]	Loss: 0.1592	LR: 0.100000
Training Epoch: 5 [24832/46250]	Loss: 0.1266	LR: 0.100000
Training Epoch: 5 [25088/46250]	Loss: 0.1150	LR: 0.100000
Training Epoch: 5 [25344/46250]	Loss: 0.1468	LR: 0.100000
Training Epoch: 5 [25600/46250]	Loss: 0.1221	LR: 0.100000
Training Epoch: 5 [25856/46250]	Loss: 0.1846	LR: 0.100000
Training Epoch: 5 [26112/46250]	Loss: 0.1993	LR: 0.100000
Training Epoch: 5 [26368/46250]	Loss: 0.2080	LR: 0.100000
Training Epoch: 5 [26624/46250]	Loss: 0.2071	LR: 0.100000
Training Epoch: 5 [26880/46250]	Loss: 0.1169	LR: 0.100000
Training Epoch: 5 [27136/46250]	Loss: 0.1743	LR: 0.100000
Training Epoch: 5 [27392/46250]	Loss: 0.1951	LR: 0.100000
Training Epoch: 5 [27648/46250]	Loss: 0.1026	LR: 0.100000
Training Epoch: 5 [27904/46250]	Loss: 0.1884	LR: 0.100000
Training Epoch: 5 [28160/46250]	Loss: 0.1165	LR: 0.100000
Training Epoch: 5 [28416/46250]	Loss: 0.1593	LR: 0.100000
Training Epoch: 5 [28672/46250]	Loss: 0.1560	LR: 0.100000
Training Epoch: 5 [28928/46250]	Loss: 0.1649	LR: 0.100000
Training Epoch: 5 [29184/46250]	Loss: 0.2961	LR: 0.100000
Training Epoch: 5 [29440/46250]	Loss: 0.1168	LR: 0.100000
Training Epoch: 5 [29696/46250]	Loss: 0.1010	LR: 0.100000
Training Epoch: 5 [29952/46250]	Loss: 0.1396	LR: 0.100000
Training Epoch: 5 [30208/46250]	Loss: 0.1642	LR: 0.100000
Training Epoch: 5 [30464/46250]	Loss: 0.2212	LR: 0.100000
Training Epoch: 5 [30720/46250]	Loss: 0.2453	LR: 0.100000
Training Epoch: 5 [30976/46250]	Loss: 0.1395	LR: 0.100000
Training Epoch: 5 [31232/46250]	Loss: 0.1336	LR: 0.100000
Training Epoch: 5 [31488/46250]	Loss: 0.1224	LR: 0.100000
Training Epoch: 5 [31744/46250]	Loss: 0.1439	LR: 0.100000
Training Epoch: 5 [32000/46250]	Loss: 0.1031	LR: 0.100000
Training Epoch: 5 [32256/46250]	Loss: 0.1682	LR: 0.100000
Training Epoch: 5 [32512/46250]	Loss: 0.2113	LR: 0.100000
Training Epoch: 5 [32768/46250]	Loss: 0.1925	LR: 0.100000
Training Epoch: 5 [33024/46250]	Loss: 0.2249	LR: 0.100000
Training Epoch: 5 [33280/46250]	Loss: 0.2594	LR: 0.100000
Training Epoch: 5 [33536/46250]	Loss: 0.1284	LR: 0.100000
Training Epoch: 5 [33792/46250]	Loss: 0.1642	LR: 0.100000
Training Epoch: 5 [34048/46250]	Loss: 0.1906	LR: 0.100000
Training Epoch: 5 [34304/46250]	Loss: 0.1123	LR: 0.100000
Training Epoch: 5 [34560/46250]	Loss: 0.1324	LR: 0.100000
Training Epoch: 5 [34816/46250]	Loss: 0.1189	LR: 0.100000
Training Epoch: 5 [35072/46250]	Loss: 0.1334	LR: 0.100000
Training Epoch: 5 [35328/46250]	Loss: 0.2116	LR: 0.100000
Training Epoch: 5 [35584/46250]	Loss: 0.1750	LR: 0.100000
Training Epoch: 5 [35840/46250]	Loss: 0.1084	LR: 0.100000
Training Epoch: 5 [36096/46250]	Loss: 0.1564	LR: 0.100000
Training Epoch: 5 [36352/46250]	Loss: 0.1146	LR: 0.100000
Training Epoch: 5 [36608/46250]	Loss: 0.1508	LR: 0.100000
Training Epoch: 5 [36864/46250]	Loss: 0.2247	LR: 0.100000
Training Epoch: 5 [37120/46250]	Loss: 0.1682	LR: 0.100000
Training Epoch: 5 [37376/46250]	Loss: 0.1568	LR: 0.100000
Training Epoch: 5 [37632/46250]	Loss: 0.1323	LR: 0.100000
Training Epoch: 5 [37888/46250]	Loss: 0.2191	LR: 0.100000
Training Epoch: 5 [38144/46250]	Loss: 0.1402	LR: 0.100000
Training Epoch: 5 [38400/46250]	Loss: 0.1311	LR: 0.100000
Training Epoch: 5 [38656/46250]	Loss: 0.1855	LR: 0.100000
Training Epoch: 5 [38912/46250]	Loss: 0.2183	LR: 0.100000
Training Epoch: 5 [39168/46250]	Loss: 0.1532	LR: 0.100000
Training Epoch: 5 [39424/46250]	Loss: 0.1908	LR: 0.100000
Training Epoch: 5 [39680/46250]	Loss: 0.1636	LR: 0.100000
Training Epoch: 5 [39936/46250]	Loss: 0.1849	LR: 0.100000
Training Epoch: 5 [40192/46250]	Loss: 0.1240	LR: 0.100000
Training Epoch: 5 [40448/46250]	Loss: 0.1276	LR: 0.100000
Training Epoch: 5 [40704/46250]	Loss: 0.1621	LR: 0.100000
Training Epoch: 5 [40960/46250]	Loss: 0.1445	LR: 0.100000
Training Epoch: 5 [41216/46250]	Loss: 0.1450	LR: 0.100000
Training Epoch: 5 [41472/46250]	Loss: 0.1095	LR: 0.100000
Training Epoch: 5 [41728/46250]	Loss: 0.1617	LR: 0.100000
Training Epoch: 5 [41984/46250]	Loss: 0.1407	LR: 0.100000
Training Epoch: 5 [42240/46250]	Loss: 0.1289	LR: 0.100000
Training Epoch: 5 [42496/46250]	Loss: 0.1298	LR: 0.100000
Training Epoch: 5 [42752/46250]	Loss: 0.1421	LR: 0.100000
Training Epoch: 5 [43008/46250]	Loss: 0.1533	LR: 0.100000
Training Epoch: 5 [43264/46250]	Loss: 0.1992	LR: 0.100000
Training Epoch: 5 [43520/46250]	Loss: 0.1546	LR: 0.100000
Training Epoch: 5 [43776/46250]	Loss: 0.1945	LR: 0.100000
Training Epoch: 5 [44032/46250]	Loss: 0.1526	LR: 0.100000
Training Epoch: 5 [44288/46250]	Loss: 0.1577	LR: 0.100000
Training Epoch: 5 [44544/46250]	Loss: 0.2076	LR: 0.100000
Training Epoch: 5 [44800/46250]	Loss: 0.2081	LR: 0.100000
Training Epoch: 5 [45056/46250]	Loss: 0.1963	LR: 0.100000
Training Epoch: 5 [45312/46250]	Loss: 0.2171	LR: 0.100000
Training Epoch: 5 [45568/46250]	Loss: 0.2168	LR: 0.100000
Training Epoch: 5 [45824/46250]	Loss: 0.1769	LR: 0.100000
Training Epoch: 5 [46080/46250]	Loss: 0.2100	LR: 0.100000
Training Epoch: 5 [46250/46250]	Loss: 0.1582	LR: 0.100000
Epoch 5 - Average Train Loss: 0.1589, Train Accuracy: 0.9454
Epoch 5 training time consumed: 334.43s
Evaluating Network.....
Test set: Epoch: 5, Average loss: 0.0007, Accuracy: 0.9456, Time consumed:23.56s
Training Epoch: 6 [256/46250]	Loss: 0.1462	LR: 0.100000
Training Epoch: 6 [512/46250]	Loss: 0.1697	LR: 0.100000
Training Epoch: 6 [768/46250]	Loss: 0.1034	LR: 0.100000
Training Epoch: 6 [1024/46250]	Loss: 0.1285	LR: 0.100000
Training Epoch: 6 [1280/46250]	Loss: 0.1230	LR: 0.100000
Training Epoch: 6 [1536/46250]	Loss: 0.1849	LR: 0.100000
Training Epoch: 6 [1792/46250]	Loss: 0.1598	LR: 0.100000
Training Epoch: 6 [2048/46250]	Loss: 0.1504	LR: 0.100000
Training Epoch: 6 [2304/46250]	Loss: 0.1340	LR: 0.100000
Training Epoch: 6 [2560/46250]	Loss: 0.1735	LR: 0.100000
Training Epoch: 6 [2816/46250]	Loss: 0.1434	LR: 0.100000
Training Epoch: 6 [3072/46250]	Loss: 0.2045	LR: 0.100000
Training Epoch: 6 [3328/46250]	Loss: 0.1369	LR: 0.100000
Training Epoch: 6 [3584/46250]	Loss: 0.1631	LR: 0.100000
Training Epoch: 6 [3840/46250]	Loss: 0.1308	LR: 0.100000
Training Epoch: 6 [4096/46250]	Loss: 0.2477	LR: 0.100000
Training Epoch: 6 [4352/46250]	Loss: 0.2082	LR: 0.100000
Training Epoch: 6 [4608/46250]	Loss: 0.2200	LR: 0.100000
Training Epoch: 6 [4864/46250]	Loss: 0.1982	LR: 0.100000
Training Epoch: 6 [5120/46250]	Loss: 0.1246	LR: 0.100000
Training Epoch: 6 [5376/46250]	Loss: 0.1292	LR: 0.100000
Training Epoch: 6 [5632/46250]	Loss: 0.2262	LR: 0.100000
Training Epoch: 6 [5888/46250]	Loss: 0.1386	LR: 0.100000
Training Epoch: 6 [6144/46250]	Loss: 0.1022	LR: 0.100000
Training Epoch: 6 [6400/46250]	Loss: 0.1199	LR: 0.100000
Training Epoch: 6 [6656/46250]	Loss: 0.1670	LR: 0.100000
Training Epoch: 6 [6912/46250]	Loss: 0.1100	LR: 0.100000
Training Epoch: 6 [7168/46250]	Loss: 0.1239	LR: 0.100000
Training Epoch: 6 [7424/46250]	Loss: 0.1881	LR: 0.100000
Training Epoch: 6 [7680/46250]	Loss: 0.1388	LR: 0.100000
Training Epoch: 6 [7936/46250]	Loss: 0.1111	LR: 0.100000
Training Epoch: 6 [8192/46250]	Loss: 0.1837	LR: 0.100000
Training Epoch: 6 [8448/46250]	Loss: 0.1694	LR: 0.100000
Training Epoch: 6 [8704/46250]	Loss: 0.1679	LR: 0.100000
Training Epoch: 6 [8960/46250]	Loss: 0.2335	LR: 0.100000
Training Epoch: 6 [9216/46250]	Loss: 0.1064	LR: 0.100000
Training Epoch: 6 [9472/46250]	Loss: 0.2226	LR: 0.100000
Training Epoch: 6 [9728/46250]	Loss: 0.1974	LR: 0.100000
Training Epoch: 6 [9984/46250]	Loss: 0.1790	LR: 0.100000
Training Epoch: 6 [10240/46250]	Loss: 0.1327	LR: 0.100000
Training Epoch: 6 [10496/46250]	Loss: 0.1730	LR: 0.100000
Training Epoch: 6 [10752/46250]	Loss: 0.1846	LR: 0.100000
Training Epoch: 6 [11008/46250]	Loss: 0.1669	LR: 0.100000
Training Epoch: 6 [11264/46250]	Loss: 0.1076	LR: 0.100000
Training Epoch: 6 [11520/46250]	Loss: 0.1994	LR: 0.100000
Training Epoch: 6 [11776/46250]	Loss: 0.0831	LR: 0.100000
Training Epoch: 6 [12032/46250]	Loss: 0.1842	LR: 0.100000
Training Epoch: 6 [12288/46250]	Loss: 0.1561	LR: 0.100000
Training Epoch: 6 [12544/46250]	Loss: 0.1151	LR: 0.100000
Training Epoch: 6 [12800/46250]	Loss: 0.1295	LR: 0.100000
Training Epoch: 6 [13056/46250]	Loss: 0.1480	LR: 0.100000
Training Epoch: 6 [13312/46250]	Loss: 0.1340	LR: 0.100000
Training Epoch: 6 [13568/46250]	Loss: 0.1459	LR: 0.100000
Training Epoch: 6 [13824/46250]	Loss: 0.1507	LR: 0.100000
Training Epoch: 6 [14080/46250]	Loss: 0.2248	LR: 0.100000
Training Epoch: 6 [14336/46250]	Loss: 0.1258	LR: 0.100000
Training Epoch: 6 [14592/46250]	Loss: 0.1566	LR: 0.100000
Training Epoch: 6 [14848/46250]	Loss: 0.1372	LR: 0.100000
Training Epoch: 6 [15104/46250]	Loss: 0.2677	LR: 0.100000
Training Epoch: 6 [15360/46250]	Loss: 0.1391	LR: 0.100000
Training Epoch: 6 [15616/46250]	Loss: 0.1453	LR: 0.100000
Training Epoch: 6 [15872/46250]	Loss: 0.1153	LR: 0.100000
Training Epoch: 6 [16128/46250]	Loss: 0.1072	LR: 0.100000
Training Epoch: 6 [16384/46250]	Loss: 0.1643	LR: 0.100000
Training Epoch: 6 [16640/46250]	Loss: 0.1586	LR: 0.100000
Training Epoch: 6 [16896/46250]	Loss: 0.2491	LR: 0.100000
Training Epoch: 6 [17152/46250]	Loss: 0.2031	LR: 0.100000
Training Epoch: 6 [17408/46250]	Loss: 0.2026	LR: 0.100000
Training Epoch: 6 [17664/46250]	Loss: 0.1920	LR: 0.100000
Training Epoch: 6 [17920/46250]	Loss: 0.1389	LR: 0.100000
Training Epoch: 6 [18176/46250]	Loss: 0.1746	LR: 0.100000
Training Epoch: 6 [18432/46250]	Loss: 0.2225	LR: 0.100000
Training Epoch: 6 [18688/46250]	Loss: 0.1344	LR: 0.100000
Training Epoch: 6 [18944/46250]	Loss: 0.1799	LR: 0.100000
Training Epoch: 6 [19200/46250]	Loss: 0.2320	LR: 0.100000
Training Epoch: 6 [19456/46250]	Loss: 0.1888	LR: 0.100000
Training Epoch: 6 [19712/46250]	Loss: 0.2100	LR: 0.100000
Training Epoch: 6 [19968/46250]	Loss: 0.1705	LR: 0.100000
Training Epoch: 6 [20224/46250]	Loss: 0.1706	LR: 0.100000
Training Epoch: 6 [20480/46250]	Loss: 0.1033	LR: 0.100000
Training Epoch: 6 [20736/46250]	Loss: 0.1839	LR: 0.100000
Training Epoch: 6 [20992/46250]	Loss: 0.1532	LR: 0.100000
Training Epoch: 6 [21248/46250]	Loss: 0.1911	LR: 0.100000
Training Epoch: 6 [21504/46250]	Loss: 0.1687	LR: 0.100000
Training Epoch: 6 [21760/46250]	Loss: 0.1958	LR: 0.100000
Training Epoch: 6 [22016/46250]	Loss: 0.1679	LR: 0.100000
Training Epoch: 6 [22272/46250]	Loss: 0.2034	LR: 0.100000
Training Epoch: 6 [22528/46250]	Loss: 0.1071	LR: 0.100000
Training Epoch: 6 [22784/46250]	Loss: 0.1881	LR: 0.100000
Training Epoch: 6 [23040/46250]	Loss: 0.2072	LR: 0.100000
Training Epoch: 6 [23296/46250]	Loss: 0.1830	LR: 0.100000
Training Epoch: 6 [23552/46250]	Loss: 0.1723	LR: 0.100000
Training Epoch: 6 [23808/46250]	Loss: 0.2148	LR: 0.100000
Training Epoch: 6 [24064/46250]	Loss: 0.1813	LR: 0.100000
Training Epoch: 6 [24320/46250]	Loss: 0.1890	LR: 0.100000
Training Epoch: 6 [24576/46250]	Loss: 0.2439	LR: 0.100000
Training Epoch: 6 [24832/46250]	Loss: 0.1282	LR: 0.100000
Training Epoch: 6 [25088/46250]	Loss: 0.1661	LR: 0.100000
Training Epoch: 6 [25344/46250]	Loss: 0.2220	LR: 0.100000
Training Epoch: 6 [25600/46250]	Loss: 0.2665	LR: 0.100000
Training Epoch: 6 [25856/46250]	Loss: 0.1463	LR: 0.100000
Training Epoch: 6 [26112/46250]	Loss: 0.1605	LR: 0.100000
Training Epoch: 6 [26368/46250]	Loss: 0.2765	LR: 0.100000
Training Epoch: 6 [26624/46250]	Loss: 0.1643	LR: 0.100000
Training Epoch: 6 [26880/46250]	Loss: 0.3068	LR: 0.100000
Training Epoch: 6 [27136/46250]	Loss: 0.1697	LR: 0.100000
Training Epoch: 6 [27392/46250]	Loss: 0.1812	LR: 0.100000
Training Epoch: 6 [27648/46250]	Loss: 0.2585	LR: 0.100000
Training Epoch: 6 [27904/46250]	Loss: 0.1755	LR: 0.100000
Training Epoch: 6 [28160/46250]	Loss: 0.1721	LR: 0.100000
Training Epoch: 6 [28416/46250]	Loss: 0.1090	LR: 0.100000
Training Epoch: 6 [28672/46250]	Loss: 0.2497	LR: 0.100000
Training Epoch: 6 [28928/46250]	Loss: 0.1706	LR: 0.100000
Training Epoch: 6 [29184/46250]	Loss: 0.1791	LR: 0.100000
Training Epoch: 6 [29440/46250]	Loss: 0.1606	LR: 0.100000
Training Epoch: 6 [29696/46250]	Loss: 0.1499	LR: 0.100000
Training Epoch: 6 [29952/46250]	Loss: 0.2444	LR: 0.100000
Training Epoch: 6 [30208/46250]	Loss: 0.2113	LR: 0.100000
Training Epoch: 6 [30464/46250]	Loss: 0.2808	LR: 0.100000
Training Epoch: 6 [30720/46250]	Loss: 0.2113	LR: 0.100000
Training Epoch: 6 [30976/46250]	Loss: 0.2458	LR: 0.100000
Training Epoch: 6 [31232/46250]	Loss: 0.1077	LR: 0.100000
Training Epoch: 6 [31488/46250]	Loss: 0.3167	LR: 0.100000
Training Epoch: 6 [31744/46250]	Loss: 0.3125	LR: 0.100000
Training Epoch: 6 [32000/46250]	Loss: 0.1599	LR: 0.100000
Training Epoch: 6 [32256/46250]	Loss: 0.2002	LR: 0.100000
Training Epoch: 6 [32512/46250]	Loss: 0.2400	LR: 0.100000
Training Epoch: 6 [32768/46250]	Loss: 0.2063	LR: 0.100000
Training Epoch: 6 [33024/46250]	Loss: 0.2125	LR: 0.100000
Training Epoch: 6 [33280/46250]	Loss: 0.2076	LR: 0.100000
Training Epoch: 6 [33536/46250]	Loss: 0.1715	LR: 0.100000
Training Epoch: 6 [33792/46250]	Loss: 0.1864	LR: 0.100000
Training Epoch: 6 [34048/46250]	Loss: 0.1253	LR: 0.100000
Training Epoch: 6 [34304/46250]	Loss: 0.2441	LR: 0.100000
Training Epoch: 6 [34560/46250]	Loss: 0.0967	LR: 0.100000
Training Epoch: 6 [34816/46250]	Loss: 0.0758	LR: 0.100000
Training Epoch: 6 [35072/46250]	Loss: 0.2138	LR: 0.100000
Training Epoch: 6 [35328/46250]	Loss: 0.1012	LR: 0.100000
Training Epoch: 6 [35584/46250]	Loss: 0.1428	LR: 0.100000
Training Epoch: 6 [35840/46250]	Loss: 0.1929	LR: 0.100000
Training Epoch: 6 [36096/46250]	Loss: 0.1749	LR: 0.100000
Training Epoch: 6 [36352/46250]	Loss: 0.2931	LR: 0.100000
Training Epoch: 6 [36608/46250]	Loss: 0.2327	LR: 0.100000
Training Epoch: 6 [36864/46250]	Loss: 0.2014	LR: 0.100000
Training Epoch: 6 [37120/46250]	Loss: 0.2378	LR: 0.100000
Training Epoch: 6 [37376/46250]	Loss: 0.1640	LR: 0.100000
Training Epoch: 6 [37632/46250]	Loss: 0.2699	LR: 0.100000
Training Epoch: 6 [37888/46250]	Loss: 0.2211	LR: 0.100000
Training Epoch: 6 [38144/46250]	Loss: 0.2074	LR: 0.100000
Training Epoch: 6 [38400/46250]	Loss: 0.2029	LR: 0.100000
Training Epoch: 6 [38656/46250]	Loss: 0.2110	LR: 0.100000
Training Epoch: 6 [38912/46250]	Loss: 0.1178	LR: 0.100000
Training Epoch: 6 [39168/46250]	Loss: 0.2018	LR: 0.100000
Training Epoch: 6 [39424/46250]	Loss: 0.2118	LR: 0.100000
Training Epoch: 6 [39680/46250]	Loss: 0.2060	LR: 0.100000
Training Epoch: 6 [39936/46250]	Loss: 0.2299	LR: 0.100000
Training Epoch: 6 [40192/46250]	Loss: 0.1549	LR: 0.100000
Training Epoch: 6 [40448/46250]	Loss: 0.2154	LR: 0.100000
Training Epoch: 6 [40704/46250]	Loss: 0.2163	LR: 0.100000
Training Epoch: 6 [40960/46250]	Loss: 0.1407	LR: 0.100000
Training Epoch: 6 [41216/46250]	Loss: 0.2737	LR: 0.100000
Training Epoch: 6 [41472/46250]	Loss: 0.1606	LR: 0.100000
Training Epoch: 6 [41728/46250]	Loss: 0.2586	LR: 0.100000
Training Epoch: 6 [41984/46250]	Loss: 0.2526	LR: 0.100000
Training Epoch: 6 [42240/46250]	Loss: 0.2314	LR: 0.100000
Training Epoch: 6 [42496/46250]	Loss: 0.2575	LR: 0.100000
Training Epoch: 6 [42752/46250]	Loss: 0.1839	LR: 0.100000
Training Epoch: 6 [43008/46250]	Loss: 0.1892	LR: 0.100000
Training Epoch: 6 [43264/46250]	Loss: 0.1430	LR: 0.100000
Training Epoch: 6 [43520/46250]	Loss: 0.2142	LR: 0.100000
Training Epoch: 6 [43776/46250]	Loss: 0.1637	LR: 0.100000
Training Epoch: 6 [44032/46250]	Loss: 0.1559	LR: 0.100000
Training Epoch: 6 [44288/46250]	Loss: 0.1987	LR: 0.100000
Training Epoch: 6 [44544/46250]	Loss: 0.2432	LR: 0.100000
Training Epoch: 6 [44800/46250]	Loss: 0.1430	LR: 0.100000
Training Epoch: 6 [45056/46250]	Loss: 0.2096	LR: 0.100000
Training Epoch: 6 [45312/46250]	Loss: 0.1786	LR: 0.100000
Training Epoch: 6 [45568/46250]	Loss: 0.1856	LR: 0.100000
Training Epoch: 6 [45824/46250]	Loss: 0.2221	LR: 0.100000
Training Epoch: 6 [46080/46250]	Loss: 0.1387	LR: 0.100000
Training Epoch: 6 [46250/46250]	Loss: 0.2664	LR: 0.100000
Epoch 6 - Average Train Loss: 0.1813, Train Accuracy: 0.9370
Epoch 6 training time consumed: 334.63s
Evaluating Network.....
Test set: Epoch: 6, Average loss: 0.0007, Accuracy: 0.9417, Time consumed:23.53s
Training Epoch: 7 [256/46250]	Loss: 0.2244	LR: 0.020000
Training Epoch: 7 [512/46250]	Loss: 0.2130	LR: 0.020000
Training Epoch: 7 [768/46250]	Loss: 0.1199	LR: 0.020000
Training Epoch: 7 [1024/46250]	Loss: 0.1489	LR: 0.020000
Training Epoch: 7 [1280/46250]	Loss: 0.0413	LR: 0.020000
Training Epoch: 7 [1536/46250]	Loss: 0.0802	LR: 0.020000
Training Epoch: 7 [1792/46250]	Loss: 0.1132	LR: 0.020000
Training Epoch: 7 [2048/46250]	Loss: 0.0606	LR: 0.020000
Training Epoch: 7 [2304/46250]	Loss: 0.1131	LR: 0.020000
Training Epoch: 7 [2560/46250]	Loss: 0.1202	LR: 0.020000
Training Epoch: 7 [2816/46250]	Loss: 0.1595	LR: 0.020000
Training Epoch: 7 [3072/46250]	Loss: 0.0710	LR: 0.020000
Training Epoch: 7 [3328/46250]	Loss: 0.0680	LR: 0.020000
Training Epoch: 7 [3584/46250]	Loss: 0.0601	LR: 0.020000
Training Epoch: 7 [3840/46250]	Loss: 0.0707	LR: 0.020000
Training Epoch: 7 [4096/46250]	Loss: 0.0823	LR: 0.020000
Training Epoch: 7 [4352/46250]	Loss: 0.0646	LR: 0.020000
Training Epoch: 7 [4608/46250]	Loss: 0.0454	LR: 0.020000
Training Epoch: 7 [4864/46250]	Loss: 0.0903	LR: 0.020000
Training Epoch: 7 [5120/46250]	Loss: 0.0472	LR: 0.020000
Training Epoch: 7 [5376/46250]	Loss: 0.0671	LR: 0.020000
Training Epoch: 7 [5632/46250]	Loss: 0.0491	LR: 0.020000
Training Epoch: 7 [5888/46250]	Loss: 0.0853	LR: 0.020000
Training Epoch: 7 [6144/46250]	Loss: 0.1301	LR: 0.020000
Training Epoch: 7 [6400/46250]	Loss: 0.0811	LR: 0.020000
Training Epoch: 7 [6656/46250]	Loss: 0.0720	LR: 0.020000
Training Epoch: 7 [6912/46250]	Loss: 0.0495	LR: 0.020000
Training Epoch: 7 [7168/46250]	Loss: 0.0964	LR: 0.020000
Training Epoch: 7 [7424/46250]	Loss: 0.0830	LR: 0.020000
Training Epoch: 7 [7680/46250]	Loss: 0.0675	LR: 0.020000
Training Epoch: 7 [7936/46250]	Loss: 0.0440	LR: 0.020000
Training Epoch: 7 [8192/46250]	Loss: 0.0927	LR: 0.020000
Training Epoch: 7 [8448/46250]	Loss: 0.0479	LR: 0.020000
Training Epoch: 7 [8704/46250]	Loss: 0.1267	LR: 0.020000
Training Epoch: 7 [8960/46250]	Loss: 0.0363	LR: 0.020000
Training Epoch: 7 [9216/46250]	Loss: 0.1037	LR: 0.020000
Training Epoch: 7 [9472/46250]	Loss: 0.0813	LR: 0.020000
Training Epoch: 7 [9728/46250]	Loss: 0.0827	LR: 0.020000
Training Epoch: 7 [9984/46250]	Loss: 0.0415	LR: 0.020000
Training Epoch: 7 [10240/46250]	Loss: 0.0561	LR: 0.020000
Training Epoch: 7 [10496/46250]	Loss: 0.0449	LR: 0.020000
Training Epoch: 7 [10752/46250]	Loss: 0.0638	LR: 0.020000
Training Epoch: 7 [11008/46250]	Loss: 0.0431	LR: 0.020000
Training Epoch: 7 [11264/46250]	Loss: 0.0610	LR: 0.020000
Training Epoch: 7 [11520/46250]	Loss: 0.0719	LR: 0.020000
Training Epoch: 7 [11776/46250]	Loss: 0.0828	LR: 0.020000
Training Epoch: 7 [12032/46250]	Loss: 0.0611	LR: 0.020000
Training Epoch: 7 [12288/46250]	Loss: 0.0561	LR: 0.020000
Training Epoch: 7 [12544/46250]	Loss: 0.0547	LR: 0.020000
Training Epoch: 7 [12800/46250]	Loss: 0.0602	LR: 0.020000
Training Epoch: 7 [13056/46250]	Loss: 0.0364	LR: 0.020000
Training Epoch: 7 [13312/46250]	Loss: 0.0574	LR: 0.020000
Training Epoch: 7 [13568/46250]	Loss: 0.0790	LR: 0.020000
Training Epoch: 7 [13824/46250]	Loss: 0.0903	LR: 0.020000
Training Epoch: 7 [14080/46250]	Loss: 0.0694	LR: 0.020000
Training Epoch: 7 [14336/46250]	Loss: 0.0832	LR: 0.020000
Training Epoch: 7 [14592/46250]	Loss: 0.0347	LR: 0.020000
Training Epoch: 7 [14848/46250]	Loss: 0.0597	LR: 0.020000
Training Epoch: 7 [15104/46250]	Loss: 0.0657	LR: 0.020000
Training Epoch: 7 [15360/46250]	Loss: 0.0666	LR: 0.020000
Training Epoch: 7 [15616/46250]	Loss: 0.0554	LR: 0.020000
Training Epoch: 7 [15872/46250]	Loss: 0.0582	LR: 0.020000
Training Epoch: 7 [16128/46250]	Loss: 0.0401	LR: 0.020000
Training Epoch: 7 [16384/46250]	Loss: 0.0483	LR: 0.020000
Training Epoch: 7 [16640/46250]	Loss: 0.0471	LR: 0.020000
Training Epoch: 7 [16896/46250]	Loss: 0.0518	LR: 0.020000
Training Epoch: 7 [17152/46250]	Loss: 0.0284	LR: 0.020000
Training Epoch: 7 [17408/46250]	Loss: 0.0544	LR: 0.020000
Training Epoch: 7 [17664/46250]	Loss: 0.0834	LR: 0.020000
Training Epoch: 7 [17920/46250]	Loss: 0.0778	LR: 0.020000
Training Epoch: 7 [18176/46250]	Loss: 0.0670	LR: 0.020000
Training Epoch: 7 [18432/46250]	Loss: 0.0368	LR: 0.020000
Training Epoch: 7 [18688/46250]	Loss: 0.1111	LR: 0.020000
Training Epoch: 7 [18944/46250]	Loss: 0.0659	LR: 0.020000
Training Epoch: 7 [19200/46250]	Loss: 0.0266	LR: 0.020000
Training Epoch: 7 [19456/46250]	Loss: 0.0579	LR: 0.020000
Training Epoch: 7 [19712/46250]	Loss: 0.0426	LR: 0.020000
Training Epoch: 7 [19968/46250]	Loss: 0.0480	LR: 0.020000
Training Epoch: 7 [20224/46250]	Loss: 0.0734	LR: 0.020000
Training Epoch: 7 [20480/46250]	Loss: 0.0515	LR: 0.020000
Training Epoch: 7 [20736/46250]	Loss: 0.0583	LR: 0.020000
Training Epoch: 7 [20992/46250]	Loss: 0.0659	LR: 0.020000
Training Epoch: 7 [21248/46250]	Loss: 0.0398	LR: 0.020000
Training Epoch: 7 [21504/46250]	Loss: 0.0279	LR: 0.020000
Training Epoch: 7 [21760/46250]	Loss: 0.0551	LR: 0.020000
Training Epoch: 7 [22016/46250]	Loss: 0.0499	LR: 0.020000
Training Epoch: 7 [22272/46250]	Loss: 0.0465	LR: 0.020000
Training Epoch: 7 [22528/46250]	Loss: 0.0616	LR: 0.020000
Training Epoch: 7 [22784/46250]	Loss: 0.0129	LR: 0.020000
Training Epoch: 7 [23040/46250]	Loss: 0.1104	LR: 0.020000
Training Epoch: 7 [23296/46250]	Loss: 0.0323	LR: 0.020000
Training Epoch: 7 [23552/46250]	Loss: 0.0776	LR: 0.020000
Training Epoch: 7 [23808/46250]	Loss: 0.0913	LR: 0.020000
Training Epoch: 7 [24064/46250]	Loss: 0.0415	LR: 0.020000
Training Epoch: 7 [24320/46250]	Loss: 0.0460	LR: 0.020000
Training Epoch: 7 [24576/46250]	Loss: 0.0763	LR: 0.020000
Training Epoch: 7 [24832/46250]	Loss: 0.0284	LR: 0.020000
Training Epoch: 7 [25088/46250]	Loss: 0.0720	LR: 0.020000
Training Epoch: 7 [25344/46250]	Loss: 0.0489	LR: 0.020000
Training Epoch: 7 [25600/46250]	Loss: 0.0347	LR: 0.020000
Training Epoch: 7 [25856/46250]	Loss: 0.0195	LR: 0.020000
Training Epoch: 7 [26112/46250]	Loss: 0.0605	LR: 0.020000
Training Epoch: 7 [26368/46250]	Loss: 0.0571	LR: 0.020000
Training Epoch: 7 [26624/46250]	Loss: 0.0470	LR: 0.020000
Training Epoch: 7 [26880/46250]	Loss: 0.0397	LR: 0.020000
Training Epoch: 7 [27136/46250]	Loss: 0.0565	LR: 0.020000
Training Epoch: 7 [27392/46250]	Loss: 0.0813	LR: 0.020000
Training Epoch: 7 [27648/46250]	Loss: 0.0217	LR: 0.020000
Training Epoch: 7 [27904/46250]	Loss: 0.0572	LR: 0.020000
Training Epoch: 7 [28160/46250]	Loss: 0.0560	LR: 0.020000
Training Epoch: 7 [28416/46250]	Loss: 0.0690	LR: 0.020000
Training Epoch: 7 [28672/46250]	Loss: 0.0609	LR: 0.020000
Training Epoch: 7 [28928/46250]	Loss: 0.0452	LR: 0.020000
Training Epoch: 7 [29184/46250]	Loss: 0.0732	LR: 0.020000
Training Epoch: 7 [29440/46250]	Loss: 0.0194	LR: 0.020000
Training Epoch: 7 [29696/46250]	Loss: 0.0645	LR: 0.020000
Training Epoch: 7 [29952/46250]	Loss: 0.0819	LR: 0.020000
Training Epoch: 7 [30208/46250]	Loss: 0.0379	LR: 0.020000
Training Epoch: 7 [30464/46250]	Loss: 0.0367	LR: 0.020000
Training Epoch: 7 [30720/46250]	Loss: 0.0535	LR: 0.020000
Training Epoch: 7 [30976/46250]	Loss: 0.0299	LR: 0.020000
Training Epoch: 7 [31232/46250]	Loss: 0.0670	LR: 0.020000
Training Epoch: 7 [31488/46250]	Loss: 0.0401	LR: 0.020000
Training Epoch: 7 [31744/46250]	Loss: 0.0717	LR: 0.020000
Training Epoch: 7 [32000/46250]	Loss: 0.0348	LR: 0.020000
Training Epoch: 7 [32256/46250]	Loss: 0.0351	LR: 0.020000
Training Epoch: 7 [32512/46250]	Loss: 0.0464	LR: 0.020000
Training Epoch: 7 [32768/46250]	Loss: 0.0213	LR: 0.020000
Training Epoch: 7 [33024/46250]	Loss: 0.0344	LR: 0.020000
Training Epoch: 7 [33280/46250]	Loss: 0.0781	LR: 0.020000
Training Epoch: 7 [33536/46250]	Loss: 0.0305	LR: 0.020000
Training Epoch: 7 [33792/46250]	Loss: 0.0269	LR: 0.020000
Training Epoch: 7 [34048/46250]	Loss: 0.0573	LR: 0.020000
Training Epoch: 7 [34304/46250]	Loss: 0.0612	LR: 0.020000
Training Epoch: 7 [34560/46250]	Loss: 0.0460	LR: 0.020000
Training Epoch: 7 [34816/46250]	Loss: 0.0875	LR: 0.020000
Training Epoch: 7 [35072/46250]	Loss: 0.0550	LR: 0.020000
Training Epoch: 7 [35328/46250]	Loss: 0.0391	LR: 0.020000
Training Epoch: 7 [35584/46250]	Loss: 0.0853	LR: 0.020000
Training Epoch: 7 [35840/46250]	Loss: 0.0367	LR: 0.020000
Training Epoch: 7 [36096/46250]	Loss: 0.0463	LR: 0.020000
Training Epoch: 7 [36352/46250]	Loss: 0.0877	LR: 0.020000
Training Epoch: 7 [36608/46250]	Loss: 0.0449	LR: 0.020000
Training Epoch: 7 [36864/46250]	Loss: 0.0435	LR: 0.020000
Training Epoch: 7 [37120/46250]	Loss: 0.0619	LR: 0.020000
Training Epoch: 7 [37376/46250]	Loss: 0.0540	LR: 0.020000
Training Epoch: 7 [37632/46250]	Loss: 0.0841	LR: 0.020000
Training Epoch: 7 [37888/46250]	Loss: 0.0795	LR: 0.020000
Training Epoch: 7 [38144/46250]	Loss: 0.0702	LR: 0.020000
Training Epoch: 7 [38400/46250]	Loss: 0.0320	LR: 0.020000
Training Epoch: 7 [38656/46250]	Loss: 0.0242	LR: 0.020000
Training Epoch: 7 [38912/46250]	Loss: 0.0918	LR: 0.020000
Training Epoch: 7 [39168/46250]	Loss: 0.0974	LR: 0.020000
Training Epoch: 7 [39424/46250]	Loss: 0.0475	LR: 0.020000
Training Epoch: 7 [39680/46250]	Loss: 0.0437	LR: 0.020000
Training Epoch: 7 [39936/46250]	Loss: 0.0755	LR: 0.020000
Training Epoch: 7 [40192/46250]	Loss: 0.0733	LR: 0.020000
Training Epoch: 7 [40448/46250]	Loss: 0.0299	LR: 0.020000
Training Epoch: 7 [40704/46250]	Loss: 0.0372	LR: 0.020000
Training Epoch: 7 [40960/46250]	Loss: 0.0469	LR: 0.020000
Training Epoch: 7 [41216/46250]	Loss: 0.0945	LR: 0.020000
Training Epoch: 7 [41472/46250]	Loss: 0.0277	LR: 0.020000
Training Epoch: 7 [41728/46250]	Loss: 0.0630	LR: 0.020000
Training Epoch: 7 [41984/46250]	Loss: 0.0106	LR: 0.020000
Training Epoch: 7 [42240/46250]	Loss: 0.0354	LR: 0.020000
Training Epoch: 7 [42496/46250]	Loss: 0.0101	LR: 0.020000
Training Epoch: 7 [42752/46250]	Loss: 0.0224	LR: 0.020000
Training Epoch: 7 [43008/46250]	Loss: 0.0449	LR: 0.020000
Training Epoch: 7 [43264/46250]	Loss: 0.0269	LR: 0.020000
Training Epoch: 7 [43520/46250]	Loss: 0.0961	LR: 0.020000
Training Epoch: 7 [43776/46250]	Loss: 0.0256	LR: 0.020000
Training Epoch: 7 [44032/46250]	Loss: 0.0383	LR: 0.020000
Training Epoch: 7 [44288/46250]	Loss: 0.0392	LR: 0.020000
Training Epoch: 7 [44544/46250]	Loss: 0.0402	LR: 0.020000
Training Epoch: 7 [44800/46250]	Loss: 0.0399	LR: 0.020000
Training Epoch: 7 [45056/46250]	Loss: 0.0741	LR: 0.020000
Training Epoch: 7 [45312/46250]	Loss: 0.0639	LR: 0.020000
Training Epoch: 7 [45568/46250]	Loss: 0.0829	LR: 0.020000
Training Epoch: 7 [45824/46250]	Loss: 0.0532	LR: 0.020000
Training Epoch: 7 [46080/46250]	Loss: 0.0285	LR: 0.020000
Training Epoch: 7 [46250/46250]	Loss: 0.0691	LR: 0.020000
Epoch 7 - Average Train Loss: 0.0614, Train Accuracy: 0.9787
Epoch 7 training time consumed: 334.99s
Evaluating Network.....
Test set: Epoch: 7, Average loss: 0.0004, Accuracy: 0.9729, Time consumed:23.56s
Saving weights file to checkpoint/retrain/ViT/Friday_18_July_2025_00h_08m_56s/ViT-Cifar10-seed5-ret25-7-best.pth
Training Epoch: 8 [256/46250]	Loss: 0.0540	LR: 0.020000
Training Epoch: 8 [512/46250]	Loss: 0.0307	LR: 0.020000
Training Epoch: 8 [768/46250]	Loss: 0.0153	LR: 0.020000
Training Epoch: 8 [1024/46250]	Loss: 0.0712	LR: 0.020000
Training Epoch: 8 [1280/46250]	Loss: 0.0920	LR: 0.020000
Training Epoch: 8 [1536/46250]	Loss: 0.0320	LR: 0.020000
Training Epoch: 8 [1792/46250]	Loss: 0.0313	LR: 0.020000
Training Epoch: 8 [2048/46250]	Loss: 0.0318	LR: 0.020000
Training Epoch: 8 [2304/46250]	Loss: 0.0325	LR: 0.020000
Training Epoch: 8 [2560/46250]	Loss: 0.0275	LR: 0.020000
Training Epoch: 8 [2816/46250]	Loss: 0.0363	LR: 0.020000
Training Epoch: 8 [3072/46250]	Loss: 0.0317	LR: 0.020000
Training Epoch: 8 [3328/46250]	Loss: 0.0346	LR: 0.020000
Training Epoch: 8 [3584/46250]	Loss: 0.0131	LR: 0.020000
Training Epoch: 8 [3840/46250]	Loss: 0.0434	LR: 0.020000
Training Epoch: 8 [4096/46250]	Loss: 0.0318	LR: 0.020000
Training Epoch: 8 [4352/46250]	Loss: 0.0367	LR: 0.020000
Training Epoch: 8 [4608/46250]	Loss: 0.0385	LR: 0.020000
Training Epoch: 8 [4864/46250]	Loss: 0.0238	LR: 0.020000
Training Epoch: 8 [5120/46250]	Loss: 0.0231	LR: 0.020000
Training Epoch: 8 [5376/46250]	Loss: 0.0324	LR: 0.020000
Training Epoch: 8 [5632/46250]	Loss: 0.0234	LR: 0.020000
Training Epoch: 8 [5888/46250]	Loss: 0.0443	LR: 0.020000
Training Epoch: 8 [6144/46250]	Loss: 0.0404	LR: 0.020000
Training Epoch: 8 [6400/46250]	Loss: 0.0451	LR: 0.020000
Training Epoch: 8 [6656/46250]	Loss: 0.0504	LR: 0.020000
Training Epoch: 8 [6912/46250]	Loss: 0.0276	LR: 0.020000
Training Epoch: 8 [7168/46250]	Loss: 0.0163	LR: 0.020000
Training Epoch: 8 [7424/46250]	Loss: 0.0394	LR: 0.020000
Training Epoch: 8 [7680/46250]	Loss: 0.0360	LR: 0.020000
Training Epoch: 8 [7936/46250]	Loss: 0.0258	LR: 0.020000
Training Epoch: 8 [8192/46250]	Loss: 0.0572	LR: 0.020000
Training Epoch: 8 [8448/46250]	Loss: 0.0779	LR: 0.020000
Training Epoch: 8 [8704/46250]	Loss: 0.0440	LR: 0.020000
Training Epoch: 8 [8960/46250]	Loss: 0.0207	LR: 0.020000
Training Epoch: 8 [9216/46250]	Loss: 0.0136	LR: 0.020000
Training Epoch: 8 [9472/46250]	Loss: 0.0458	LR: 0.020000
Training Epoch: 8 [9728/46250]	Loss: 0.0393	LR: 0.020000
Training Epoch: 8 [9984/46250]	Loss: 0.0449	LR: 0.020000
Training Epoch: 8 [10240/46250]	Loss: 0.0411	LR: 0.020000
Training Epoch: 8 [10496/46250]	Loss: 0.0436	LR: 0.020000
Training Epoch: 8 [10752/46250]	Loss: 0.0407	LR: 0.020000
Training Epoch: 8 [11008/46250]	Loss: 0.0103	LR: 0.020000
Training Epoch: 8 [11264/46250]	Loss: 0.0418	LR: 0.020000
Training Epoch: 8 [11520/46250]	Loss: 0.0221	LR: 0.020000
Training Epoch: 8 [11776/46250]	Loss: 0.0186	LR: 0.020000
Training Epoch: 8 [12032/46250]	Loss: 0.0747	LR: 0.020000
Training Epoch: 8 [12288/46250]	Loss: 0.0505	LR: 0.020000
Training Epoch: 8 [12544/46250]	Loss: 0.0381	LR: 0.020000
Training Epoch: 8 [12800/46250]	Loss: 0.0357	LR: 0.020000
Training Epoch: 8 [13056/46250]	Loss: 0.0152	LR: 0.020000
Training Epoch: 8 [13312/46250]	Loss: 0.0283	LR: 0.020000
Training Epoch: 8 [13568/46250]	Loss: 0.0104	LR: 0.020000
Training Epoch: 8 [13824/46250]	Loss: 0.0341	LR: 0.020000
Training Epoch: 8 [14080/46250]	Loss: 0.0112	LR: 0.020000
Training Epoch: 8 [14336/46250]	Loss: 0.0350	LR: 0.020000
Training Epoch: 8 [14592/46250]	Loss: 0.0437	LR: 0.020000
Training Epoch: 8 [14848/46250]	Loss: 0.0342	LR: 0.020000
Training Epoch: 8 [15104/46250]	Loss: 0.0301	LR: 0.020000
Training Epoch: 8 [15360/46250]	Loss: 0.0326	LR: 0.020000
Training Epoch: 8 [15616/46250]	Loss: 0.0221	LR: 0.020000
Training Epoch: 8 [15872/46250]	Loss: 0.0658	LR: 0.020000
Training Epoch: 8 [16128/46250]	Loss: 0.0260	LR: 0.020000
Training Epoch: 8 [16384/46250]	Loss: 0.0309	LR: 0.020000
Training Epoch: 8 [16640/46250]	Loss: 0.0219	LR: 0.020000
Training Epoch: 8 [16896/46250]	Loss: 0.0461	LR: 0.020000
Training Epoch: 8 [17152/46250]	Loss: 0.0366	LR: 0.020000
Training Epoch: 8 [17408/46250]	Loss: 0.0198	LR: 0.020000
Training Epoch: 8 [17664/46250]	Loss: 0.0427	LR: 0.020000
Training Epoch: 8 [17920/46250]	Loss: 0.0281	LR: 0.020000
Training Epoch: 8 [18176/46250]	Loss: 0.0274	LR: 0.020000
Training Epoch: 8 [18432/46250]	Loss: 0.0515	LR: 0.020000
Training Epoch: 8 [18688/46250]	Loss: 0.0589	LR: 0.020000
Training Epoch: 8 [18944/46250]	Loss: 0.0273	LR: 0.020000
Training Epoch: 8 [19200/46250]	Loss: 0.0693	LR: 0.020000
Training Epoch: 8 [19456/46250]	Loss: 0.0443	LR: 0.020000
Training Epoch: 8 [19712/46250]	Loss: 0.0198	LR: 0.020000
Training Epoch: 8 [19968/46250]	Loss: 0.0448	LR: 0.020000
Training Epoch: 8 [20224/46250]	Loss: 0.0825	LR: 0.020000
Training Epoch: 8 [20480/46250]	Loss: 0.0294	LR: 0.020000
Training Epoch: 8 [20736/46250]	Loss: 0.0384	LR: 0.020000
Training Epoch: 8 [20992/46250]	Loss: 0.0182	LR: 0.020000
Training Epoch: 8 [21248/46250]	Loss: 0.0847	LR: 0.020000
Training Epoch: 8 [21504/46250]	Loss: 0.0374	LR: 0.020000
Training Epoch: 8 [21760/46250]	Loss: 0.0226	LR: 0.020000
Training Epoch: 8 [22016/46250]	Loss: 0.0304	LR: 0.020000
Training Epoch: 8 [22272/46250]	Loss: 0.0526	LR: 0.020000
Training Epoch: 8 [22528/46250]	Loss: 0.0908	LR: 0.020000
Training Epoch: 8 [22784/46250]	Loss: 0.0582	LR: 0.020000
Training Epoch: 8 [23040/46250]	Loss: 0.0623	LR: 0.020000
Training Epoch: 8 [23296/46250]	Loss: 0.0195	LR: 0.020000
Training Epoch: 8 [23552/46250]	Loss: 0.0441	LR: 0.020000
Training Epoch: 8 [23808/46250]	Loss: 0.0309	LR: 0.020000
Training Epoch: 8 [24064/46250]	Loss: 0.0388	LR: 0.020000
Training Epoch: 8 [24320/46250]	Loss: 0.0312	LR: 0.020000
Training Epoch: 8 [24576/46250]	Loss: 0.0227	LR: 0.020000
Training Epoch: 8 [24832/46250]	Loss: 0.0440	LR: 0.020000
Training Epoch: 8 [25088/46250]	Loss: 0.0490	LR: 0.020000
Training Epoch: 8 [25344/46250]	Loss: 0.0184	LR: 0.020000
Training Epoch: 8 [25600/46250]	Loss: 0.0471	LR: 0.020000
Training Epoch: 8 [25856/46250]	Loss: 0.0154	LR: 0.020000
Training Epoch: 8 [26112/46250]	Loss: 0.0413	LR: 0.020000
Training Epoch: 8 [26368/46250]	Loss: 0.0436	LR: 0.020000
Training Epoch: 8 [26624/46250]	Loss: 0.0283	LR: 0.020000
Training Epoch: 8 [26880/46250]	Loss: 0.0503	LR: 0.020000
Training Epoch: 8 [27136/46250]	Loss: 0.0520	LR: 0.020000
Training Epoch: 8 [27392/46250]	Loss: 0.0494	LR: 0.020000
Training Epoch: 8 [27648/46250]	Loss: 0.0455	LR: 0.020000
Training Epoch: 8 [27904/46250]	Loss: 0.0399	LR: 0.020000
Training Epoch: 8 [28160/46250]	Loss: 0.0460	LR: 0.020000
Training Epoch: 8 [28416/46250]	Loss: 0.0674	LR: 0.020000
Training Epoch: 8 [28672/46250]	Loss: 0.0230	LR: 0.020000
Training Epoch: 8 [28928/46250]	Loss: 0.0226	LR: 0.020000
Training Epoch: 8 [29184/46250]	Loss: 0.0595	LR: 0.020000
Training Epoch: 8 [29440/46250]	Loss: 0.0329	LR: 0.020000
Training Epoch: 8 [29696/46250]	Loss: 0.0572	LR: 0.020000
Training Epoch: 8 [29952/46250]	Loss: 0.0226	LR: 0.020000
Training Epoch: 8 [30208/46250]	Loss: 0.0284	LR: 0.020000
Training Epoch: 8 [30464/46250]	Loss: 0.0326	LR: 0.020000
Training Epoch: 8 [30720/46250]	Loss: 0.0520	LR: 0.020000
Training Epoch: 8 [30976/46250]	Loss: 0.0378	LR: 0.020000
Training Epoch: 8 [31232/46250]	Loss: 0.0295	LR: 0.020000
Training Epoch: 8 [31488/46250]	Loss: 0.0146	LR: 0.020000
Training Epoch: 8 [31744/46250]	Loss: 0.0838	LR: 0.020000
Training Epoch: 8 [32000/46250]	Loss: 0.0443	LR: 0.020000
Training Epoch: 8 [32256/46250]	Loss: 0.0515	LR: 0.020000
Training Epoch: 8 [32512/46250]	Loss: 0.0592	LR: 0.020000
Training Epoch: 8 [32768/46250]	Loss: 0.0420	LR: 0.020000
Training Epoch: 8 [33024/46250]	Loss: 0.0445	LR: 0.020000
Training Epoch: 8 [33280/46250]	Loss: 0.0515	LR: 0.020000
Training Epoch: 8 [33536/46250]	Loss: 0.0240	LR: 0.020000
Training Epoch: 8 [33792/46250]	Loss: 0.0143	LR: 0.020000
Training Epoch: 8 [34048/46250]	Loss: 0.0145	LR: 0.020000
Training Epoch: 8 [34304/46250]	Loss: 0.0740	LR: 0.020000
Training Epoch: 8 [34560/46250]	Loss: 0.0467	LR: 0.020000
Training Epoch: 8 [34816/46250]	Loss: 0.0425	LR: 0.020000
Training Epoch: 8 [35072/46250]	Loss: 0.0409	LR: 0.020000
Training Epoch: 8 [35328/46250]	Loss: 0.0532	LR: 0.020000
Training Epoch: 8 [35584/46250]	Loss: 0.0456	LR: 0.020000
Training Epoch: 8 [35840/46250]	Loss: 0.0314	LR: 0.020000
Training Epoch: 8 [36096/46250]	Loss: 0.0536	LR: 0.020000
Training Epoch: 8 [36352/46250]	Loss: 0.0163	LR: 0.020000
Training Epoch: 8 [36608/46250]	Loss: 0.0301	LR: 0.020000
Training Epoch: 8 [36864/46250]	Loss: 0.0168	LR: 0.020000
Training Epoch: 8 [37120/46250]	Loss: 0.0282	LR: 0.020000
Training Epoch: 8 [37376/46250]	Loss: 0.0324	LR: 0.020000
Training Epoch: 8 [37632/46250]	Loss: 0.0096	LR: 0.020000
Training Epoch: 8 [37888/46250]	Loss: 0.0322	LR: 0.020000
Training Epoch: 8 [38144/46250]	Loss: 0.0380	LR: 0.020000
Training Epoch: 8 [38400/46250]	Loss: 0.0294	LR: 0.020000
Training Epoch: 8 [38656/46250]	Loss: 0.0309	LR: 0.020000
Training Epoch: 8 [38912/46250]	Loss: 0.0308	LR: 0.020000
Training Epoch: 8 [39168/46250]	Loss: 0.0615	LR: 0.020000
Training Epoch: 8 [39424/46250]	Loss: 0.0230	LR: 0.020000
Training Epoch: 8 [39680/46250]	Loss: 0.0845	LR: 0.020000
Training Epoch: 8 [39936/46250]	Loss: 0.0753	LR: 0.020000
Training Epoch: 8 [40192/46250]	Loss: 0.0160	LR: 0.020000
Training Epoch: 8 [40448/46250]	Loss: 0.0324	LR: 0.020000
Training Epoch: 8 [40704/46250]	Loss: 0.0659	LR: 0.020000
Training Epoch: 8 [40960/46250]	Loss: 0.0739	LR: 0.020000
Training Epoch: 8 [41216/46250]	Loss: 0.0366	LR: 0.020000
Training Epoch: 8 [41472/46250]	Loss: 0.0491	LR: 0.020000
Training Epoch: 8 [41728/46250]	Loss: 0.0375	LR: 0.020000
Training Epoch: 8 [41984/46250]	Loss: 0.0241	LR: 0.020000
Training Epoch: 8 [42240/46250]	Loss: 0.0388	LR: 0.020000
Training Epoch: 8 [42496/46250]	Loss: 0.0479	LR: 0.020000
Training Epoch: 8 [42752/46250]	Loss: 0.0318	LR: 0.020000
Training Epoch: 8 [43008/46250]	Loss: 0.0549	LR: 0.020000
Training Epoch: 8 [43264/46250]	Loss: 0.0290	LR: 0.020000
Training Epoch: 8 [43520/46250]	Loss: 0.0230	LR: 0.020000
Training Epoch: 8 [43776/46250]	Loss: 0.0639	LR: 0.020000
Training Epoch: 8 [44032/46250]	Loss: 0.0729	LR: 0.020000
Training Epoch: 8 [44288/46250]	Loss: 0.0485	LR: 0.020000
Training Epoch: 8 [44544/46250]	Loss: 0.0605	LR: 0.020000
Training Epoch: 8 [44800/46250]	Loss: 0.0250	LR: 0.020000
Training Epoch: 8 [45056/46250]	Loss: 0.0462	LR: 0.020000
Training Epoch: 8 [45312/46250]	Loss: 0.0154	LR: 0.020000
Training Epoch: 8 [45568/46250]	Loss: 0.0786	LR: 0.020000
Training Epoch: 8 [45824/46250]	Loss: 0.0276	LR: 0.020000
Training Epoch: 8 [46080/46250]	Loss: 0.0702	LR: 0.020000
Training Epoch: 8 [46250/46250]	Loss: 0.0340	LR: 0.020000
Epoch 8 - Average Train Loss: 0.0394, Train Accuracy: 0.9860
Epoch 8 training time consumed: 334.06s
Evaluating Network.....
Test set: Epoch: 8, Average loss: 0.0003, Accuracy: 0.9752, Time consumed:23.56s
Saving weights file to checkpoint/retrain/ViT/Friday_18_July_2025_00h_08m_56s/ViT-Cifar10-seed5-ret25-8-best.pth
Valid (Test) Dl:  10000
Train Dl:  50000
Retain Train Dl:  46250
Forget Train Dl:  3750
Retain Valid Dl:  46250
Forget Valid Dl:  3750
retain_prob Distribution: 10000 samples
test_prob Distribution: 10000 samples
forget_prob Distribution: 3750 samples
Set1 Distribution: 3750 samples
Set2 Distribution: 3750 samples
Set1 Distribution: 3750 samples
Set2 Distribution: 3750 samples
Set1 Distribution: 10000 samples
Set2 Distribution: 10000 samples
Set1 Distribution: 10000 samples
Set2 Distribution: 10000 samples
Test Accuracy: 97.578125
Retain Accuracy: 99.12808990478516
Zero-Retain Forget (ZRF): 0.7665109634399414
Membership Inference Attack (MIA): 0.8450666666666666
Forget vs Retain Membership Inference Attack (MIA): 0.49866666666666665
Forget vs Test Membership Inference Attack (MIA): 0.5126666666666667
Test vs Retain Membership Inference Attack (MIA): 0.50075
Train vs Test Membership Inference Attack (MIA): 0.51675
Forget Set Accuracy (Df): 97.00081634521484
Method Execution Time: 5289.47 seconds
